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Introduction 


A current theory of genetics assumes that deoxyribonucleic acid comprises 
the gene, and that the different genes are discriminated from each other 
by the seriation of the nucleotides along the long axis of the DNA mole- 
cule. The principal basis for this theory is the fact that treatment with a 
DNA preparation isolated from a normal culture can transform a culture 
carrying a defective gene into one carrying the normal gene. It has been 
inferred, therefore, that DNA comprises the gene and that the different 
genes are different from each other because of the different arrangements 
of nucleotides within them. This inference has been supported by the 
fact that destruction of DNA by a specific enzyme destroys the capacity 
of a DNA preparation to effect transformation of a defective into a normal 
gene. 

Transformation is a directed mutation of a specific defective gene into 
the corresponding normal gene by a chemical substance. The transforma- 
|tion experiments with DNA preparations (which were previously assumed 
‘to have been achieved by a purified soluble capsular carbohydrate) are 
‘now assumed to be achieved by treatment with the DNA isolated from the 
organism carrying the dominant gene. It has been inferred as a result of 
'these experiments that DNA, and DNA alone, is the transforming agent. 
In studies of the melezitose locus of Saccharomyces, transformation. is 
achieved without DNA by the use of maltose and melezitose. In previous 
‘experiments transformations of a defective galactose locus had been 
achieved by the use of galactose. It has been inferred as a result of the 
DNA transformations that DNA carries some “genetic information”. If 
this inference is justified we may infer that the sugars, maltose, melezitose 
and galactose also carry “genetic information” (Lindegren & Pittman, 
1959): 

The hypothesis that the gene is composed exclusively of DNA does not 
account for the facts: (1) that transformations can be achieved by the use 
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of saccharides without DNA, (2) that most of the chromatin in the cell] 
(possibly as much as go per cent) is heterochromatin which does not carr | 
genes (the large heterochromatinic Y-chromosome in Drosophila is nearly} 
gene-free), and (3) that euchromatin does not always carry genes. (The 
B-chromosome in maize contains both euchromatin and heterochromatin 
but does not carry genes.) | 

As a substitute for the theory that DNA alone comprises the gene, lf 
propose (1) that DNA has the specific function of controlling homologous} 
synapsis of chromosomes, (2) that the genes are located on the surface off 
the DNA, (3) that in spite of their fundamental stability genes are to some 
extent malleable and can be remodelled and (4) that the transformations] 
achieved by maltose, melezitose and galactose are effected by remodelling} 
the surface of the gene. If genes can be remodelled by substrate, it seem 
reasonable to suppose that genes act as receptors of substrate to initiate th 
synthesis of enzymes (or as receptors of gene-products to terminate enzym 
synthesis). 

The discrimination between euchromatin and heterochromatin has been 
studied by Jack Schultz (1947) and I shall summarize his views briefly 
Euchromatin is the region of the chromosome which carries the Mende-4 
lian genes; heterochromatin does not carry Mendelian genes except for ai 
few which might be supposed to have been intercalated into the hetero- 
chromatin by translocation. Euchromatin is much poorer in deoxy- 
ribonucleic acid than heterochromatin; heterochromatin tends to remai 
compact and heavily staining after telophase when the euchromatin has. 
spun out into diffusely staining uncoiled threads. The heterochromatin has 
been called “‘prochromosomes” or “‘chromocentres” and said to be} 
“heteropycnotic” or “pycnotic” indicating its denser character. It hasif 
been inferred that there are two kinds of heterochromatin: the a-type 
which does not uncoil and the B-type which uncoils into a diffuse mesh 
visible in the chromocentres of the giant salivary and nerve cell chromo- 
somes. 

The euchromatinic regions of the giant salivary chromosomes of 
Drosophila are divided into bands and interband regions (Fig. 1). The 
heavy bands are richer in DNA than the interbands and contain both 
RNA and lower polymers of DNA. Although the proteins are difficult to 
distinguish in the bands, the presence of arginine in the bands indicate: 
that a basic protein is present there. The interbands carry little nucleic: 
acid but consist mainly of proteins containing aromatic amino acids. One 
type of protein in the interband stains with fast green and is easily soluble 
in slightly acid solutions while the other is (1) unstaining, (2) resistant to’ 
acid and (3) highly elastic. These two proteins may be the histone- and 
tryptophane-proteins of Mirsky, Pollister & Ris (1946). In most hetero-| 
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chromatin the interbands are absent (although the Y-chromosome in 
D. buscki (Krivshenko, 1950) has faint bands and interbands) and the 
DNA “capsules” are connected to each other by strands of the same kind of 
nucleoprotein of which they are themselves composed; in euchromatin 
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Fic. 1. Diagrammatic representation of the chromosome containing euchromatin, 
heterochromatin and the nucleolar connecting region. 


the bands (or capsules), which are rich in DNA, are connected by inter- 
band strands of elastic protein. 

The heterochromatinic regions of the different chromosomes of D. 
melanogaster all coalesce into an apparently homogeneous association at 
meiosis, although this does not occur in maize nor in most of the other 
Diptera (White, 1954). In all species the chromomeres of the euchromatin 
synapse in pairs, at meiosis, homologous chromomere with homologous 
chromomere, excluding the third chromonema in triploids. (Although the 
third chromonema is excluded at meiosis, all three chromosomes fuse in 
the salivaries of triploid Drosophila.) The precise pairing of euchromatinic 
chromomeres and the not-so-precise pairing of heterochromatin or 
heterochromatinic knobs indicate that each euchromatinic chromomere is 
unique and tends to associate in synapsis with its homologous chromomere 
but that the chromomeres of heterochromatin are not so distinctive, at 
least with regard to synapsis, for they occasionally associate in a non- 
homologous manner. 
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The Function of Heterochromatin 


Schultz has summarized the current concept of the function of hetero-. 
chromatin and has made some inferences of his own: A male Drosophila 
without the heterochromatinic Y-chromosome is sterile but otherwise 
normal. Males that have lost the major heterochromatinic region of the 
X-chromosome are fertile if they contain a Y-chromosome. Many of the 
rearrangements on X-chromosomes in Drosophila produce viable males 
that are sterile. When both breaks in a rearrangement involve only the 
heterochromatinic regions the male is usually fertile; when the X is broken 
in a euchromatinic region the male is usually fertile, but when the X is 
broken in a euchromatinic region and translocated to an autosomal euchro- 
matinic region the males are sterile. From this Schultz infers that the 
heterochromatins in different regions have similar functions. The hetero- 
chromatin of the Y-chromosomes is essential for normal spermatogenesis. |ff 

Schultz points out that rearrangements between euchromatin and hetero- 
chromatin lead to variegation of the characters controlled by the genes 
which are brought into the proximity of heterochromatin, indicating that 
heterochromatin changes the stability of the gene. With regard to the 
effect of heterochromatin on variegation, Schultz says, ‘“The essential fact 
is that the regions containing the genes for which variegation is displayed | 
show regular changes that are correlated in their frequency and extent 
with the frequency and extent of the variegation.”” This inference is sup- 
ported by data from yeast which indicate that the internal chromosomal | 
environment, especially as it concerns the availability of adenine, affects | 
the stability of the gene (Lindegren & Lindegren, 1956). Schultz points | 
out that in the absence of the Y-chromosome the bands in the euchromatin | 
stain lightly and the nucleoprotein components are affected not only in 
amount but in arrangement as well, and he concludes that ‘‘the hetero- 
chromatic regions function in what may provisionally be termed, for lack | 
of a more precise statement, as the generalized metabolism of the 
chromosome.” 

Schultz has inferred that heterochromatin is involved in the synthesis 
of nucleolar substance. In both the salivary band nuclei and the maize 
chromosomes the nucleolus is associated with heterochromatin. Schultz, 
Caspersson & Aquilonius (1940) were able to show that “the nucleoli 
contain ribosenucleo-proteins which were shown to vary in composition 
according to the genetic structure of the animal. This is the cardinal fact 
which lies at the base of the theory that the heterochromatic regions have 


to do with cytoplasmic synthesis in general, by way of precursors deriving 
from the nucleoli.” 
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An adequate supply of heterochromatin (1) ensures fertility, (2) controls 
variegation and (3) ensures the synthesis of an adequate supply of nucleolar 
material. ‘The nucleolar material is assumed to transmit the effects of gene 
action to the cytoplasm and there to produce the specific gene-effect, pre- 
sumably in the form of an enzyme. 


The Biological Function of DNA 


The theory that DNA comprises the genetic material and that crossing- 
over rearranges the nucleotides in the DNA implies that the synapsis of 
the chromonemata is effected by homologous nucleotide-to-nucleotide 
pairing of the strands of DNA. It is difficult to understand, however, (1) 
how one double helix could attract another, (2) why the attraction should 
be exerted only at meiosis, and (3) why only single strands should attract 
only other single strands, i.e. why pairing is always in two’s and multiple 
synapses do not occur in euchromatin. I shall propose (1) that DNA does 
not comprise the genic material, (2) but that DNA forms a part of the 
chromosome on which the genes are carried, (3) that synapsis is effected 
by the “zippering-up” of the bases of homologously ‘‘coded’’ single 
helices in a narrow band on one side of the euchromatinic regions of the 
chromosome at zygotene, (4) that the alignment of nucleotides is relatively 
permanent and not altered by crossing-over or mutation, but (5) that the 
alignment serves to achieve precisely homologous synapsis of the euchro- 
matin in which the genes are carried and thus preserves the order of the 
genes by assuring that crossing-over will almost always be precisely 
reciprocal thus avoiding the production of either deletions or duplications 
in euchromatin. 

The 2-by-2 pairing of homologous chromonemata was demonstrated by 
Newton & Darlington (1929) in a study of the synapsis of triploids. They 
found that, when three strands synapsed at leptotene, only 2 of the 3 chro- 
monemata were paired at any given point. The exclusion of the third 
chromonema in the pairing of trisomics and triploids indicates that each 
chromonema has only a single pairing valence or pairs only on one side. 
Darlington supposed that the 2-by-2 pairing was an electrical phenomenon 
but this fails to account for its predominantly homologous nature, Linde- 
sren & Bridges (1938) proposed that synapsis might be explained by the 
possible antigenic nature of the chromomeres but this might be expected 
to lead to multiple associations. 

Euchromatin and heterochromatin synapse quite differently. ‘The gene- 
-arrying euchromatin of normal chromosomes and the gene-free euchro- 
natin of the B-chromosome both undergo precisely homologous pairing. 
in contrast with euchromatin (either gene-carrying or gene-free), the 
yeterochromatin of different nonhomologous chromosomes in the salivary 
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chromosomes of Drosophila fuses in a multiply associated unaligned mass. 
This suggests that the DNA of heterochromatin is not so precisely arranged 
as that of euchromatin. The heterochromatinic regions fuse at meziosis, 
but not at other times, suggesting that the DNA of heterochromatin is J 
present in single helices at meiosis. The capacity of DNA for effecting ]} 
precise homologous pairing of the gene-bearing euchromatin ensures 
precisely reciprocal crossing-over, thus maintaining the genome com- 
pletely intact while permitting exchanges between different sections of the 
gene-bearing euchromatin. 


The B-Chromosome of Maize 


McClintock (1933) studied the synapsis of trisomic B-chromosomes in 
maize. Figure 2 is a reproduction of six of her figures describing the 
association at meiosis of three B-type chromosomes in a maize plant. 
Concerning the B-type chromosome, Rhoades and McClintock: (1935) 
state: “There has been found, however, in certain strains of maize, 
especially Black Mexican sweet corn, a type of supernumerary chromo- jf 
some which is totally unlike any members of the regular complement. |J 
It cannot be said, on the basis of its peculiar morphology, to have been J 
derived from any one member of the monoploid set and its origin is un- 
known. This type of supernumerary has been called the B-type chromo- | 
some in contradistinction to members of the normal complement which 
Randolph (1928) has designated as the A-type chromosome. The B-type 
has a distinctive morphology quite unlike any of the A-type chromosomes. 
It appears to be composed of genetically inert material and carries no 
known genes. The presence of one or many B-type chromosomes has no 
visible effect on the morphological character of the plant.” (This proves 
that the B-chromosome does not carry genes.) ‘‘Randolph has succeeded 
through successive crosses in accumulating more than twenty-five B-type 
chromosomes in a single plant in addition to the regular complement of | 
twenty. In contrast with supernumeraries composed of A-type chromo- 
somes the B-type is readily transmitted through both pollen and eggs. 

“The morphology of the B-type chromosome at pachytene and its | 
synaptic behavior have been investigated by McClintock (1933). In the | 
meiotic prophase the B-type is slightly more than one-half the length of | 
the shortest member of the normal complement. Its morphology at mid- 
prophase, beginning with the terminal insertion region, is as follows: (1) 
terminal spindle fiber attachment region, (2) pycnotic (heterochromatic) | 
region, (3) chromatic (euchromatic) region composed of small but distinct 
chromomeres, (4) elongate pycnotic region with several definite con- 


strictions, (5) bulging pycnotic region, (6) broken pycnotic region com- 
posed of four distinct parts.” 
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Fic. 4. The association of three B-type chromosomes. In this figure only 
homologous association has occurred. Plant 536-2. 


Fic. 5. The association of three B-type chromosomes to produce a T 
configuration. Homologous association produced the two side arms. The 
upper arm resulted from nonhomologous association. Plant S 31-314-2b. 


Fig. 6. The association of three B-type chromosomes to produce an 
asymmetrical T configuration. The 2-by-2 association in the arm to the 
right is nonhomologous. Plant S 31-314-2b. 


Fic. 2a. Figures 4 to 6 from McClintock’s 1933 paper. Mag. < 1650. (By courtesy of 
Springer-Verlag, Heidelberg). 


(I have numbered these six regions in McClintock’s Fig. 4. The only 
euchromatinic region in this chromosome is region 3, which Rhoades 
and McClintock called a “chromatic” region. Regions 2, 4, 5 and 6 are 
heterochromatinic regions for which they use the older term “pycnotic’’.) 
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Fic. 7 


Fic. 9 


Fic. 7. The association of three B-type chromosomes. Only homologous 
parts are associated 2-by-2. The univalent threads to the lower right and 
left have remained unassociated. Plant 536-2. 

Fic. 8. The association of three B-type chromosomes. The chromosomes 
are numbered 1, 2 and 3. Chromosome 1 associated at the spindle fiber 
attachment end with chromosome 2. Chromosome 1 associated at the 
opposite end with chromosome 3. The rest of chromosome 2 is in the form 
of a foldback. Such figures are comparatively frequent. Plant 536-2. 

Fic. 9. The association of three B-type chromosomes to form a complex 
configuration. Such complex trivalent configurations have been observed 
occasionally in plants trisomic for a member of the monoploid complement. 
Plant 536-2. 


Fic. 2b. Figures 7 to 9 from McClintock’s 1933 paper. Mag. x 1650. (By courtesy of 
Springer-Verlag, Heidelberg). 
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“The greater part of the B-type at mid-prophase is composed of pycnotic 
material. As stated before, there is reason to’believe that the B-type chro- 
mosome is genetically inert. That these two facts have some close relation- 
ship is suggested by Heitz’s studies (1933, 1934) with Drosophila in which 
he shows that the pycnotic portions found in prophase chromosomes are 
genetically inert. 

“The B-type shows no synaptic affinity for any of the chromosomes 
composing the normal set. If a single B-type is present it behaves as a 
univalent but regions within it are often non-homologously paired at mid- 
prophase in meiosis. Synapsis occurs between B-type chromosomes if 
two or more are present in the same nucleus although non-homologous 
association is very common. 

“McClintock found that in plants with two B-types there were more 
sporocytes with two univalent B-type chromosomes at diakinesis than 
there were in mid-prophase where the B-types were usually paired. This 
has been attributed to the complete separation of the two members of a 
B-type bivalent during diplotene and early diakinesis which occurs in 
some sporocytes. This precocious separation may be due to frequent 
occurrence of non-homologous association observed at pachytene and/or 
to partial failure of chiasma formation when homologous pairing does 
occur.” 

The legends under McClintock’s figures describe the association of 
three B-type chromosomes and point out that the association is always in 
pairs, the third chromatid being excluded in any region where two have 
already synapsed. It was in the study of the synapsis of B-chromosomes 
that she discovered the phenomenon of nonhomologous synapsis; she says, 
in private communication, that it is more conspicuous but probably not 
more common between paired B-type than between paired normal chro- 
mosomes. If one assumes that these six figures give a good sample of the 
different kinds of nonhomologous synapsis, it is clear that nonhomologous 
synapsis of euchromatin to euchromatin is rare. For example, the non- 
homologous synapsis in her Fig. 5 occurs in the upward region of the 
inverted T and is a nonhomologous synapsis of heterochromatin to hetero- 
chromatin, and euchromatin to heterochromatin; nonhomologous synapsis 
of euchromatin to euchromatin does not occur. Similarly, in her Fig. 6 the 
euchromatin synapses homologously but in one part of the T there is a 
synapsis of two nonhomologous heterochromatinic regions, together with 
a fold back in the small terminal heterochromatinic region designated as 2. 
But the fold back does not lead to homologous synapsis in the dotted 
euchromatinic region which remains single. In all the other figures the 
same pattern holds: if nonhomologous synapsis occurs it is either between 
two nonhomologous heterochromatinic regions or between a heterochro- 


116 CARL C. LINDEGREN 


Og 


Leptonema Zygonema Pachynema 
Diplonema Diakinesis 


Metaphase I 


h\\ @ CS 
YS @ 


Anaphase I Telophase I 
Interphase 
SZ EQ= 
ee OC— /S$SaX 

— = 

eee ee 

Metaphase II Anaphase IL 
Prophase IT 


Fic. 3. The stages of meiosis. Highly schematized diagram of meiosis in which a 
single pair of chromosomes is followed. (Figure 1 from Rhoades’s 1950 paper, by courtesy J} 
of the American Genetic Association.) 


matinic and a euchromatinic region but not between two nonhomologous J 
euchromatinic regions. On the basis of these figures, one might infer that | | 
the synapsis in a euchromatinic region is generally homologous, whether ] 
or not that region contains genes, and that synapsis between heterochro- ] 
matinic regions may be either homologous or nonhomologous with a 
general preference for homologous synapsis. It may be inferred that the. 
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coding of DNA in euchromatin ensures precise pairing but that the coding 
in heterochromatin does not invariably ensure precise pairing. I propose 
that the synapsis in euchromatin is due to the exposure of a narrow band 
of single helices of DNA and that it is this narrow band which ensures 
that only two strands are associated in any given region, while the third 
strand is excluded. On this proposal, one can account for both the pre- 
cision of the pairing and the fact that only two strands are involved at any 
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Centromere 


Nucleolus 


Fic. 4. The ten maize chromosomes. Schematic diagram of pachytene chromosomes: 
each bivalent chromosome is represented by a single line in order to simplify the diagram. 
(Figure 2 from Rhoades’s 1950 paper, by courtesy of the American Genetic Association.) 


point. One function of heterochromatin, in addition to those proposed by 
Schultz, may be to supply an adequate reserve of nucleotides to prevent 
the occurrence of deficiencies in euchromatin and thus to preserve the 
continuity of the genome. If the reserve nucleotides were present in short 
chains of open helices, this would explain the tendency of heterochromatin 
for nonhomologous synapsis. The nonhomologous synapsis of hetero- 
chromatin in the salivary chromosomes of Drosophila is much more pro- 
nounced than in maize; since in Drosophila all the heterochromatin fuses 
into a single chromocenter suggesting that most of the heterochromatin is 
in the form of short single helices. 

The asynaptic and the sticky gene in maize studied by Beadle (1932) 
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disturb normal pairing at meiosis. In a general summary of maize genetics}} 
Rhoades & McClintock (1935) state that the asynaptic gene “disturbs inj} 
some way the normal pairing of chromosomes in early prophase so that} 
most of the chromosomes are present as univalents at diakinesis andj} 
metaphase. The sticky gene (st), as its name implies, causes the chromo-}} 
somes to stick together so that at metaphase I there is a clumped mass off} 
chromatin rather than ten independent bivalent pairs. The sticky gen 
apparently increases frequency of non-disjunction, gene mutation and#} 
produces translocations, deficiencies and chromosome fragments.” 

It may be assumed that the asynaptic gene reduces the number of single} 
helices which appear at meiosis. The sticky gene may cause the entire J) 
chromosome to be covered with exposed single helices at meiosis. One ff} 
may infer that the conditions that lead to the exposure of coded DNA 
along the side of the chromosomes in a single narrow band at meiosis and 
not in mitosis are the result of some balanced physiological mechanism. | 
Rhoades (1950) has described meiosis in maize (Fig. 3) and pointed out 
that the centromeres at meiosis often synapse nonhomologously. 

“The centromeres of non-homologous chromosomes are often observed 
paired or stuck together at pachynema (Fig. 4) although they become free if} 
at late diplonema. Likewise the heterochromatic knobs on different | 
chromosomes have a striking tendency to unite at pachynema. Presumably} 
both the pairing of centromeres and knobs is due to the non-specificity of 
these regions, i.e. the centromere regions of the various chromosomes are of 
similar composition, as are the knobs (heterochromatin).”’ This fact may ff 
be the basis for the phenomenon of reverse linkage observed in Saccharo- 
myces (Shult & Lindegren, 1956) or of the phenomenon of affinity ob- 
served in mice (Wallace, 1958). 


Conclusions 


It is inferred that the coding of DNA ensures precise pairing of homo- | 
logous regions at meiosis and that the double helix ensures precise duplica- 
tion of the coded structure and that this is the biological function of DNA. 

If coding of nucleotides were involved in gene action, one would expect 
the chromatin of the B-chromosome (which is presumed to be coded 
because it undergoes homologous synapsis) to produce enzyme, or to J 
stimulate gene actions which would interfere with the gene action of the. 
standard genome. But neither B-chromosomes nor heterochromatin 
“interferes” with the action of the standard genome suggesting that neither | 
the disordered (short chain?) chromatin of the heterochromatin nor the 
regularly arranged gene-free nucleotides of the B-chromosome has any | 
effect on the genes carried in the euchromatin of the standard chromo- 
somes and that these materials have survival value in producing an 
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abundant reserve of short chain nucleotides which ensures that the 
euchromatin can always be maintained intact. 

The phenomenon of position effect suggests that the genes (in those 
organisms that show this phenomenon) function most effectively when they 
are located in specific places on the chromosome. The effectiveness of 
DNA in transformation, transduction and the peculiar type of partial 
hybridization, described by Wollman, Jacob & Hayes (1956), and Jacob 
& Wollman (1958), are explicable if one assumes that the coded DNA of 
the donor’s transforming principle synapses with similarly coded DNA 
of the recipient chromosome, inserting the chromosomal segment into a 
functional position in the recipient chromosome. Treatment of transform- 
ing principle with DNase would destroy the coded carrier and make the 
transforming principle ineffective. 

On this theory, reciprocal crossing-over would not rearrange the 
nucleotides. (Unequal crossing-over would make the chromosome longer.) 
If crossing-over occurred at Ca bands between the gene-carrying segments 
of DNA, as proposed by Steffensen (1953; 1955), the genes would be re- 
assorted by crossing-over, but the aperiodic arrangement of the nucleo- 
tides in a species would be permanent and species-specific. 
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Tracer Dynamics: | 
I. A Tentative Approach and Definition of Fundamental | 
Concepts 
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The tracer system is discussed in terms of two disjoint sets of atoms: the 
set of tracer atoms and the set of naturally occurring atoms—the mother 
substance. The two kinds of atoms have the same atomic number and 
differ only in respect of the atomic weight, i.e. different isotopes of one 
and the same element. In terms of elementary set-theoretical concepts 
two postulates are formulated which together define a stochastic system. 
As a special case the “‘finite” tracer system is defined as a finite Markov 
chain, and it is shown that this approach gives results equivalent to those 
usually derived from deterministic models. 

The methods applied make possible a rigorous definition of concepts 
such as a general compartment and a compartment with ‘‘complete 
mixing” (here called a ‘‘subsystem’’). Special attention is devoted to the 
concept of turnover, which is given a rigorous definition applicable to a 
large class of compartments. 


1. Introduction 


If we have a system in which a certain substance, in the following referred | 
to as the mother substance, takes part in different processes such as diffusions | 
and chemical reactions, we often want information about these processes. 
It seems then natural to assume that one could obtain such information by 
observing the behavior of another, easily observable substance, which is 
introduced in small amounts into the system, and the kinetic behavior of | 
which can be regarded as equivalent to that of the mother substance. . 
Such a substance is called a tracer. The analysis of such observational 
data is performed under the assumption that the amount of tracer intro- 
duced is so small that the “natural state” of the system is not disturbed. | 

Several papers discuss the theory of such methods (for instance, | 
Solomon, 1949; Sheppard & Householder, 1951; Rescigno, 1954, 1956 
and 1960; Berman & Schoenfeld, 1956; Hart, 1955, 1957, 1958 and 1960), 
but hitherto the main interest seems to have been focused on the purely | 
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mathematical problems connected with the analysis of the tracer data.t 
However, such treatments must always be based on certain assumptions or 
postulates in order to render a formal analysis possible. One of the pur- 
poses of the present investigation, of which this paper is the first of a series, 
is to give a rigorous formulation of those postulates which seem necessary 
for the theoretical approach. Now, we may use postulates mainly in two 
ways: 

1. We have a mathematical model which describes the observed 
behavior of a physical system and we postulate certain physical signifi- 
cances of the parameters that appear in that model, an approach which 
is used, for instance, in wave mechanics. 

2. We postulate certain properties of the physical system and deduce 
from them the “expected” behavior of the system, e.g. as in statistical 
mechanics. 

It is this last type of approach which will be used here and we may 
regard the postulates as a definition of a special system constituting the 
physical background of our formal treatment. 

Postulates, when used in this way, must be reasonable from a physical 
point of view and must permit a formal analysis. Evidently the choice of 
such postulates is, within certain limits, a matter of taste. Thus, the 
stochastic approach formulated in the next section will later be used in a 
deterministic form, and it may therefore be asked if it would not have 
been more justified to work throughout from a deterministic point of view. 
However, the stochastic model seems more satisfactory, especially since 
we shall try to treat a somewhat more general system than the one usually 
considered. Further, the stochastic model has another advantage, of great 
importance at least from a theoretical point of view, which is that it takes 
into account the possibilities of random fluctuations in the behavior of the 
system. Even if at present such fluctuations do not seem to be of much 
practical interest, with experimental improvements they may one day 
appear important also to the experimentalist. 

In this investigation we shall try to look at the system as a physical one 
in its own right and not so much as a purely mathematical problem. An 
attempt is made to put together some new results with those hitherto 
scattered in literature. On the other hand, as the treatments are formal 
and somewhat abstract, the results may sometimes look quite trivial from 
a purely experimental-physical point of view. However, there are hardly 
any definitions, assumptions or statements which are so trivial that they 
are not worth a rigorous formulation and, if possible, also a formal proof. 


+ A critical paper on a common method in this field has been published by Bergner 
(1959), and an error in that paper has given rise to certain discussions (Berman & 


Schoenfeld, 1960, and Bergner, 1960). 
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Often this will simplify the view and the understanding of the problems,| 
experimental or theoretical, and it is also from this point of view that the 
present work has been planned. 


2. List of Symbols 


(The numbers within brackets indicate the page where the symbol 1 
defined.) 


u = an individual atom with given fixed atomic number [123] 
S = the general tracer system [123] 
(s) = the finite tracer system [128] 
B, = the set of all uw in S [123] 
B° = the set of all naturally occurring isotopes of u in S = the} 
mother substance [123] 
B, = the set of tracer isotopes of u in S, not contained in BY [124]] 
w = the non-geometrical state variable [125] 
7 = the geometrical state, i.e. the position vector in the three | 
dimensional Euclidian space [124] 
g = (a, 7) = the position vector in Q [125] | 
O = the four-dimensional state space of uweB, [125] i 
e = the environments of S, i.e. e = QO — S [125] | 
S and e = the corresponding sets of points in O [125] 
o(q, t) = the content density of tracer in the point ¢ at time ¢ [126] 
0°(q, t) = the content density of mother substance in the point g at |] 
time ¢ [126] 
P(t) = the probability of weB; at time ¢ [128], or of any ueB,,) at |] 
t = o being in (ss), at time ¢ [131] 
C, = the compartment v of S [125] 
B= By ..Cj. = BWR [126] 
b,(t) = the content of B, at time ¢, in units of gram atoms [126] 
b(t) = the content of B) at time f¢, in units of gram atoms [126] 
FAG, t) and 
f°(q, t) = the relative content densities of B, and B° [126] 
(ss); = the subsystem 7 of (s) [128] | 
Greek letters v and p are used to indicate compartments | 
while latin letters 7 and j indicate subsystems | 
P,(4t) = the transition probability for the transition (ss), > (ss); 
during time Jt: (s) is assumed to be stationary [128] 
r = the number of (ss) of which (s) consists [128] 
X;; = the probability factor for the flux (ss); > (ss),, and equal to 


d 
AG Pa at) [129] 
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¢,,(t) = the macroscopic flux (ss); > (ss); of tracer at time t [130] 
Ci(t) = the macroscopic flux (ss); > (ss); of mother substance at 
time ¢ [130] 
= QO —C,, ie. the complement of C, relative to O [136] 
= S — C,, 1.e. the complement of C, relative to S [136] 
B, = B; - C, [136] 
A(q, v) = the turnover coefficient of B, [136] 
,(t) = the tracer turnover function of B, [136] 
o,(t) = the tracer turnover function of B, [136] 
A; = b;/b;, the specific activity in (ss); [130] 
oy(t) = the turnover function of B, [137] 
\? = the turnover factor of B, [137] 
A, = the tracer turnover factor of B, [139] 
h’, = the turnover of B, [137] 
A dot over a symbol indicates differentiation with respect 
to ¢t [129] 
A = the square matrix of order r x r of the elements A,; [131] 


Q 
C 


b,(t) 
b(t) = the column vector {| [132] 
b,(t) 
IN = Avogadro’s number [130] 
C,(t) = the flux of tracer from e into C, [137] 


3. The Postulates 


In the following the term tracer system S will mean any kind of physical 
system on which tracer kinetic experiments can be performed; that is, the 
physical object of our research, such as a rabbit, or more exactly, those 
parts of the rabbit where the tracer will be found after being given to the 
animal. Therefore S does not necessarily contain any tracer. 

Let u stand for an individual atom with fixed atomic number and let 
B, stand for the set of all such u in S. Thus weB, (u belonging to Bs) means 
that uw is somewhere in S and has a given atomic number. We may now 
consider different isotopes J of u, so that B, can be regarded as a union 
of subsets 


B, = 1 Fete teehee pene 2! (1) 


where B2 = the set of naturally occurring isotopes of ueBs, the mother 
ubstance, while Bf |;~1,2, _ =the sets of the different isotopes of 
1B, which do not belong to BY and are given as tracer to S. In the follow- 
ng we shall limit ourselves to the case where only one kind of isotope is 
ised as tracer and we can therefore drop the index J and write 


T.B, 9 
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B, = B,U Bs (24 


where By = the set of that u-isotope <B, which is used as tracer for the 
mother substance B’. Evidently By and Bg are disjoint, or | 


BsN Bs = $ (3 


where ¢ stands for the empty set, i.e. Bs and BY have no elements in com-}} 
mon. In other words, B, U Bg is the set of all uw belonging to B, and/o1 
BY, ie. the set of ue(Bs and/or Bg), while B;N By is the intersection o4 
B, and B& and equal to the set of ue(By and Bg), but as there are no such} 
u this set is empty. 
The concept of set is here used in a purely physical sense. Therefore, 
if a set is empty (i.e. = ¢) then it means that an element belonging to that 
set is a physical impossibility. Now, if the atomic number of u is 53 then}f 
Bg is the set of all 1?’J in S, and if the tracer is chosen to be °°J ther} 
Bg is the set of all 12°J in S, even before a tracer is given to S. Thus, a set} 
may be a purely conceptual thing and it is non-empty when it has 
physical sense; we may speak of the set of 19°J in S even if there are no suc 
isotopes in S at present, in which case we say that the content of By i 
zero. The content of a set is here defined as the number of elements#} 
belonging to it. It is also true that the content of ¢ is always zero, a zerolf) 
content of a particular set not necessarily implying that this set is empty 
it makes sense to think of the elements also when they are not present. 
Now, weB, can be found in different geometrical positions 7, where 7 is} 
the geometrical position vector in the three-dimensional Euclidian space,]| 
and we shall refer to 7 as the geometrical state of u. However, the state off} 
u is of course not completely described by 7, since at a certain point 
there will often, during a finite time interval, appear several ueB, i 
different physical states, e.g. « may be a constituent of different chemica 
components the molecules of which are moving around in S. As a some-l} 
what more specific example we may take the potassium ions in blood, 
which can be found in the plasma or in the red blood cells, and within the], 
red cells there are probably different macro-molecules to which the] 
potassium ions can be bound. All such states in which wu can be found, andi 
which do not refer to any special geometrical position of u, will here be | 
called non-geometrical states. As an additional example of this kind of state: 
we have the energy content of u. | 
When we speak of a complete description of u we shall here mean all the} 
geometrical and non- geometrical coordinates which are necessary for | 
“explaining” the macroscopic behavior of wu. This is indeed a somewhat } 
vague concept, but it is not catastrophic since we usually have some] 
intuitive understanding of what we mean by the “‘state” of an atom— 
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see the examples given above. So, instead of analysing this in itself very 
interesting and fundamental concept, we shall, with e standing for the 
environments or surroundings of S, simply accept the following: 

Postulate 1. ‘The set of all non-geometrical states, which together with 
the geometrical states are sufficient for a complete description of the states 
of weB,, is finite and uniquely defined by e, S and the atomic number of u. 

In view of the fact that we always observe S with a finite accuracy, i.e. 
since there are always quantities which are too small to be observable, it 
seems natural to assume that a finite number of non-geometrical states is 
sufficient for a complete description of u when the geometrical states also 
are given. In some respects this set of sufficient non-geometrical states 
evidently depends on the experimental procedure, but as it is desirable 
to have the complete system defined independently of how the observa- 
tions are performed, we have here postulated that the set is uniquely 
given by e, S and the atomic number of u. This now means that we can 
define a variable w which only takes positive integer values 1, 2,3, ..., 
W < © corresponding to the different non-geometrical states. 

Thus, according to postulate 1, the state of a weB, is completely given by 
the point (7, w) = g in the four-dimensional space 0. We may regard QO 
as a general state space which contains all the sufficient states of ueB,, and 
it is given when e, S and the atomic number of uw are given. We can then 
look upon Sas a set of points in QO, the complement of which relative to Q is 


O—S=e 


and we shall use the symbols S and ¢ also for the corresponding sets. When 
necessary we shall explain whether S and e stand for the respective physical 
concepts or just for the sets of points in Q. In order to avoid any mis- 
understandings it should perhaps be pointed out that when we speak of all 
the sufficient states of « we understand all the states which are of sig- 
nificance with respect to the behavior of uweB,. For instance, if weB, can 
leave S for e or if during the experiment u-atoms are moving from e into S 
(e.g. by injections), then certainly the states gee are of importance and must 
be included in the set of sufficient states. It should be observed that while 
S constitutes a “region” in the Euclidian space this is not always so in the 
Q-space, i.e. it is not always the case that two points ¢, and g, belonging to 
Scan be joined by a continuous curve in SS. So, if we speak of ueB, moving 
along certain lines in Q, these lines of transitions may indeed be dis- 
continuous. 

By a compartment C, of S we understand a set of points in Q such that 
C,c S. To each C, there corresponds a subset of B, containing those 
ueBy which are in the states geC,, and we shall for such a set use the 


notation 
9—2 
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B, =a B, ‘ Cy 
a B, = By °C, BY = BS °C, 
In a preceding paragraph we touched upon the concept of content. We | 


shall now define it explicitly in terms of the content densities o(@, t) and’ 
o°(g, t) such that 


(4) 


b(t) = | o(g, #) dg, B2(2) = | 0°, t) dg (5) 

Cc, Cy 
where 6,(t) and 6°(t) are the number at time t of weB, and ueB) respec- 
tively, expressed in units of gram-atoms. In the integrals dg refers to the 
Q-space, and if dv refers to the ordinary volume of the three-dimensional | 
Euclidian space we have | 


b()= > [ o(ws, #, t) de 


wieC, v, 


| 
where V, = the Euclidian region of C, and where the summation is taken |} 


over all w; belonging to C,. It is in this way the integrals (5) should be read. | 
From (2) and (3) it then follows that the total content of By is 


b(t) + B8(t) = | {o(g, #) + 0%, O)} dg (6) 


Ss 


iy 


We shall also introduce the relative content densities of B, and B®} 
defined by 


fat = EO, 190) = SEY ©} | 


Since from (5) we have | 
[6G dg = [7G ag=x (ay Ih 
G, G, | 


these two functions define the distribution of the tracer and its mother } 
substance in C,,. i 
In the following we shall always assume that 
o(G,t)| _ <oMG, 1) 


geS 
t2>0 


GeS (9) | 


t>0 


and, throughout, all discussions of S will be performed under the assump- 1} 

tion of (9) and also that t > o. Now, if we regard a ueB, in a certain state gj | 

at time ¢ (which is thus greater than or equal to zero), we may speak of the | 

probability that this particular w during time At shall move to the state } 

Jy i.e. be in g, at t + At. | 
| 
| 


| . 
| 
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Postulate z. The probability of an arbitrary transition g; — gi in O 
during At at time ¢ is the same for all weB, independent of their prior 
histories and also independent of b,(¢) and the distribution of tracer in QO. 

These transition probabilities will be referred to as the dynamic proper- 
ties of u. By the dynamic state of S with respect to u, i.e. to a certain atomic 
number, we understand the set of all such probabilities and, when they 
are all independent of t, S is said to be stationary. Thus, what we have 
postulated is that the dynamic properties of u when in a state g are inde- 
pendent of how it appeared in that state. This may be expressed as: on a 
macroscopic level the “memory” of u can be neglected. We consider only 
the macroscopic behavior of S, and, in the same way as postulate 1, postu- 
late 2 must in the first instance be looked upon as an approximation. This 
is also true for that part of postulate 2 which states that the different 
u-isotopes have identical dynamic properties, a problem which has been 
treated, for instance, by Bigeleisen (1949). As a general rule one could 
probably say that, except for the hydrogen isotopes, the isotopic effect 
could be neglected, especially when the system is close to equilibrium. 

Another feature of postulate 2, which shows the approximate character 
of that postulate, is the assumption (g). As (g) is an inequality we have, 
from an experimental point of view, no possibilities of making a quantita- 
tive estimate of the error introduced by the use of postulate 2. However, 
we have formulated two postulates in order to make possible a formal dis- 
cussion of certain problems and they are in this respect exactly valid, with- 
out any approximations. Thus we have the fundamental properties of S 


_ independent of the experimental procedure, and so the logical background 
_ for the following discussions. But, as pointed out in the introduction, the 


form of an approach is to some extent a matter of taste. Thus, for instance, 
it may be argued that the state of wu is too vaguely introduced and that it 


_ would have been desirable to have this concept more rigorously defined. 
- However, it is always necessary to begin somewhere using concepts that 


cannot be defined in any other manner than by referring to intuition: 
in Euclidian geometry the concepts “point” and “‘straight line” are intro- 
duced in this way. The same is true for the “particle” in classical 
mechanics, and, in the present theory, the two concepts “‘complete des- 
cription of state” and “‘transition probability” are used in an analogous 


manner. 


4. The Finite Tracer System 


The approach of the preceding section may seem somewhat abstract 
and we shall try in this section to simplify the picture. ‘This is done not 
only for visual purposes but also in order to simplify the mathematical 
treatment. Thus, we shall here concern ourselves with a special type of 
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tracer system, namely that in which the set S in Q contains a finite number |} 
of points only. We shall use the symbol (s) for such a system, where the } 
number of states of weB,) is finite. We may think of the general S as | 
divided into a finite but possibly large number of small “volume ele- | 
ments” (ss);,7=1, 2, .. + 7, so Called subsystems, within which the 
g-points are regarded as corresponding to one and the same state of u. 
This is certainly an approximation and we shall accept it in the same 
manner as one often divides, in elementary calculus, the integral into a | 
finite number of small rectangles. Thus, to a given S it seems reasonable 
to assume that we can always find an (s) the behavior of which, to any 
desired degree of accuracy, agrees with that of S. However, (s) itself is of } 
course defined without approximation, and in that respect our treatment, |] 
as well as the results obtained, are non-approximate. i} 

In the following we shall always assume (s) to be stationary (see page 127). 
Then, if we regard any weB,,) in the state (ss);, the probability of a transi- 
tion to (ss), can, according to postulate 2, be written as P;(4t), which 
thus is a function of 7, 7 and At only. As P,,(4t) is independent of t we jf] 
need not bother about the distribution or the total amount of the mother |} 
substance in (s); either these variables are constant or they do not have jf 
any influence upon the transition probabilities. As « cannot be in two | 
different states at the same time, the subsystems are in this respect | 
mutually exclusive. Thus if, according to (4), B; = By - (ss); = the set of 
ueB,,) in (ss); then, as in (3), 


| 
ia) 


Bin B; = oo) (10) 


i# 


Therefore, if we regard a particular u in (ss); we are in fact faced with a 
special kind of system, namely a so-called Markov chain (see, for instance, |] 
Feller, 1957, and Bartholomay, 1958a), which here is finite and stationary, ]} 
1.e. it contains a finite number of mutually exclusive states and the transition 
probabilities are time-independent. 

Let us regard an individual w’eB,) and let P;(t) = the probability that 
u’ is found in (ss); at time ¢. Then, as all weB,) have the same transition | 
probability P;,(4t) for the transition from @, to g, during the time interval |) 
At, we have | 


Pit + 41) = 3 PKNP,(40 (1) | 
and 


eee P, (At) = 4, (12) } 


TRACER DYNAMICS: I 129 


where 
age (7 7) 
a “ (¢=7) 


We can then write according to (11)f 


I 


P{(2) 


4t-> 


lim ee (e4 At) =P 1) 


r 


tn, {S022 an P= 


I 


hat 
and so from (12) 
P(t) = X POA, (13) 
= 
d 
where A;; = P(t) | , which means that 
Taal. 
P,(4t) = 6, + A, - 4t + o(4t) (14) 


The te a is equal to the probability that w’ is in any (ss) 
whatever at time ¢ + Jt, given that w’ is in (ss), at time ¢. If the set of all 
(ss); Cc (s) represents all the possible states in which w’ can be found, then 


a Daal) == 1 A= atl, 2, , 7, and so uw’ can never leave (s). On the 
Rhee hand if y P;,(4t) < 1 for some j-values, then there are states not 
contained in (s) ner u’ may be found after time 4t, and so we have 


= 1 when (s) is closed 
2 Pia) 2 1 when (s) is not closed (15) 


and as (15) is valid for all At, (14) gives 


Ais S — DA JHU 2.0057 (16) 
133 
where we have equality for all values of 7 if and only if (s) is closed. (16) is 
an important relation to which we shall return in a following paper. 
In order to obtain the physical meaning of our treatment we shall now 
concentrate our attention on (14). As follows from (14) the term 


+ In the following a dot over a symbol is used to indicate differentiation with respect to t. 
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Arjot , where Sf stands for a very short time interval, can be inter- | | 
ij th ee 

preted as the probability that a weB, during time 4¢ shall move to (ss). 

Thus A,,dt represents the probability of a “success” in a conceptual 


a . . 
sequence of Bernoulli trials repeated (n,(t) + nj(t)) times during the | 
time interval [t, ¢ + dt] and where, according to (6), 


n(t) + n9(t) = N (Bt) + (2) = N[foG t) + 0° O}.€F (27) 
(ss)j 

(IN = Avogadro’s number). 
This means that for the expected number of ueB,;, which during 6t move 
from (ss); to (ss),, we get (,(¢) + nj(#) )A;; dt. So if we write ¢,; and 
°, for the macroscopic flux (ss); > (ss); of tracer and mother substance 
respectively, i.e. the number of gram atoms of ueB, and ueB; which in |} 
unit time move from (ss); to (ss);, then 


Cit) = Ay P(t), Ci(t) = Avs BF(2) (18a) 
and in particular for the tracer 
Ci(t) = Aybi(t) = S5(t).4,(2) (18b) 


(Ait) = b,(2)/5;(2) ) 

This is the form of postulate 2 which is most common in the literature 
(see, for instance, Sheppard & Householder, 1951; Berman & Schoenfeld, 
1956; and Robertson, 1957). A; is often known as the specific activity 
in (ss);. 

There are several interesting features of the approach used here. For 
instance, the discussion of the “reaction order” has been avoided (see 
below). We understand that if the amount of tracer is sufficiently small 
(see (g) ) then (18) is valid independently of the mechanism of the flux. 
It is also important to observe that A; is equal to b,/b? and not equal to 
Bales + 6;). Thus Sheppard & Householder (1951) assume as a principle 
that 

Cele, o5 (19) 

ij ij Ee y 9 

From an intuitive point of view (19) seems evident and it implies (18b). 
However, if instead of using postulate 2 we had begun with (19) then 
?; would have been equal to the flux of mother substance when the 
amount of tracer in (ss), is equal to b,, while from postulate 2 we have that 
#j is independent of b,, and thus equal to the flux of mother substance 
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also when there is no tracer present in (s). But, to arrive at these conclu- 
sions from postulate 2 it has been necessaty to assume (s) stationary. 
This is equivalent to the assumption that ¢?,/b° is constant, an assumption 
which seems to be evident only after the interpretation of (%/b° as a 
probabilistic quantity. On the other hand, as this is a quite natural inter- 
pretation, we understand that on using (19) instead of postulate 2 we would 
probably have arrived at the same results. But somewhere on the way a 
basic assumption or postulate is necessary, and it seems then more satis- 
factory to begin with the postulate than to formulate it second-hand so 
to speak. 

Also, the pronounced stochastic character of the approach is of some 
special interest. As (13) is valid for all 7-values and all weB,,) we have in 


matrix notation 


P(t) = 2° P(t) (20) 
: P,(t) 
where P(t) is a column vector | | and A is a square matrix of order 
xr P(t) 
Ay Aw. ++ Ay 
Agr Ago.» » Ag, 
Fo Ava om Ape 


Let us now regard the “population” of uweB,, i.e. the population of 
tracer atoms in (s). From (20) it then follows that if we know P(t) for 
t = 0, say, then P(t) is known for any value of t > o (see the assumption 
for t in connection with (g) ). Thus we have a distribution of probabilities, 
where each probability P,(¢) tells us the probability for any ueB,) at t = 0 
to be in (ss); at time ¢. To each weB,,) and each (ss), there corresponds a 
binomial distribution with P,(t) as the probability of a “‘success”’, Le. 
ueB, at time t, and (1 — P,(t)) as the probability of a “failure”, i.e. 
ue B, at time t. According to postulate 2 the probability for a transition of a 
particular weB,,) is independent of what the other w’s are doing, and there- 
fore all weB,,) are statistically independent of each other. On considering 
the binominal distribution above, this then means that each uweB,,) can be 
regarded as an independent “‘trial’”, analogous to the discussion of the 
fluxes (page 130). Hence the expected number of “successes”, i.e. the 
expected number of weB,, is given according to (17) by 


N 5,(t) = Nb (0) * P(t) (21) 
Ge(o) = > (0) 
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And so (20) impliest 


b(t) = a Bt) (22)} 


b(t) 
where b(t) = ( 
b,(t) 


ueB |.) in respective (ss) expressed in gram-atoms. 

The resemblance between (20) and (22) is only formal: P(t) is a pur 
deterministic variable while b(t) is a stochastic one. Thus, there are nol 
fluctuations associated with P,(t) while ,(t) is a mean value with corre-} 
sponding standard deviation 


Dit) = VN bg (0)P(DG — P(e) 


which means that the standard deviation relative to the mean value is 


Dit) _ V ba(0) POG = PAO) _ V be(0) | 
Noi) b(t) /N * B/N || 


This is a kind of a general \/N-rule of nature, which implies that on a 
macroscopic level such fluctuations can usually be neglected (see for} | 
instance, Bartholomay, 1958b, and Schrédinger, 1952). |) 

Analogous arguments hold for the flux ¢;,(¢) too, when ¢;(t) is regarded i] 
as a conditioned stochastic variable, i.e. given that the content of B, ati) 
time ¢ is equal to 5,(¢) then the mean value of the flux of tracer from (ss); iJ) 
to (ss); is equal to ¢;,(t) as given by (18), and the relative standard devia- i] 
tion is of the order 1/V Nb,(t). NowA, | i; 2 0 and so (16) implies that |} 
the diagonal elements of A are all nonpositive and, as is evident from (14), |} 
|A,; ° 6¢ | = the probability of a ueB; to leave (ss); during time 8¢. |} 
Thus, ¢;(t) < o and | ¢,(¢) | = the expected flux of tracer from (ss),. 
Hence if there is no flux from e into (ss),, 


Wi) = & (0 (24) 


Regarding all ¢;(¢) as independent stochastic variables, we understand |) 
that the standard deviation of b,(t) may be considerable, especially for |) 
large r-values, even if the standard deviation of each C(t) 1 is small com- | 
pared to ¢,(t) itself. However, b,(¢) is a macroscopic variable usually |} 
determined by observations of 6,(¢) at different ¢-values separated by 
finite intervals. ‘Therefore, if we cannot observe any fluctuations in b,(t) |] 


| is the column matrix of the expected number o 


t Observe that (20) is valid only for ueBis) at t = 0, and it does not take into account | 
any flux from e into (s); the same restriction is valid also for (22). . 
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then this will not be possible for 6,(t) either.t Thus, in light of the dis- 
cussion above we shall accept (18) and (22) (and then also (24) ) without 
considering their statistical nature. Hence A,; is given a statistical signifi- 
cance but the corresponding relations (18) and (22) are looked upon as 


being deterministic or phenomenological ones. But this is in part an 


approximation, and, as indicated in the introduction, due to experimental 
improvements it may some day be necessary to leave this deterministic 


_ view and take into consideration the statistical nature of the system. At 
_ present, however, distinguishing the fluctuations implied by the experi- 


mental procedure from those which are implied by the observed system 
itself seems to be an unsolved problem. 

The (ss)-concept has no necessarily evident physical significance and 
we have introduced it mainly for simplifying the formal treatment. But 
at the same time there are several simple examples where the physical 
meaning of an (ss) is quite obvious. For instance, the simple system 
shown by Fig. 1 can be regarded as an (s) consisting of two (ss) only, or, 


RiGuets 


_ in common terminology, two compartments with complete nuixing. ‘Thus we 


have (ss), and (ss), containing water solutions of substances which can 
pass through the membrane M. With good stirring assumed, the proba- 
bility that a certain particle in a given (ss) will hit M during time é¢ can 
be regarded as independent of the geometrical position of the particle 
within that (ss). Even if this statement seems to contain some non-trivial 
problems we accept it as evident from an intuitive point of view. Thus the 
two compartments contain only one (ss) each, and we have from (18) 


Forlt) = Sor(t) — Cro(t) = 621 (2)Aa(t) — £42(#)Ad(t) 
Farlt) = F2i(tAr(t) — S22(t)4Ani(t) (25) 
(4A,,(t) = A,(t) — A,(2) ) 


+ In fact, on a microscopic level the time derivative of the content of B; is equal either 
to zero or to infinity. 


) Og 
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where 7,(t) and 79,(¢) are the net fluxes of tracer and mother substance | 
respectively from (ss), to (ss). This equation is essentially the same as. 
that derived by Nims (1959, 2nd equation (36), page 333), which states | 
that, unless 4A,,(¢) = 0, observation of the net flux of tracer gives no | 
information about the net flux of the mother substance; one exception 1s | 
of course the case where £°, =o, €8, > o or £9, > 0, 63, = 0. However, } 
the thermodynamic approach used by Nimsf assigns no significance to 
the coefficient of 4A,,(¢). Whether the form of (25) is correct or not could ] 
probably be tested experimentally, but hitherto remarkably little basic | 
experimental work in tracer dynamics has been published. | 
Finally, one more thing requires brief discussion, namely the physical. 
significance of (9). Let us take, as a simple but illustrative example, the, 
case when the flux (ss); (ss): can be regarded as a reaction of order 
2 0)1.€. 
Cii(t) = Sut) + GA) = Ry(Bj(O) )" | 
where 0;(t) = b,(t) + b9(t). This equation should be regarded as a | 
phenomenological one, and therefore no microscopic interpretation is |} 


required. 
If from the beginning 5; = 0, then, for fixed f¢, 


Ci = oF | b= 0 a k,(b;)" 
and thus 
rs) i — nk,(b;)"—? 5b; -- 0(805) 


Let now a small amount 4, of tracer be added to (ss); such that b; <6), | 
hence 


8055 = 1 Ri(Bj)"* 8, 
The total flux of ueB; is now 
Ci = Sh |opao + 885 = SF + Sy 
= Ce or = dC}, + 8; 


where we evidently have 


iF * n- 
by = 8b = n Boi = Rii(bp)"~ * b, 
and 
I * 


ty = —— 80 = (2 — 1) (Bb 


ti Nims works from the phenomenological equations of irreversible thermodynamics | 
(see for instance Denbigh, 1951), and he assumes certain forms of the thermodynamic | 
forces and also some special divisions of the net fluxes. 
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» But, as b; <b, 
= [ayaa + 8h OG [ayo H Byles? 
This is straightforward and on comparing with (18) we get 
A; = k,(5)" 7, n >o 


_ However, in biological systems zero order reactions are not unlikely to 

occur, €.g. in enzymatic reactions where, according to the model of 

_ Michaelis and Menten, all the enzyme molecules are bound to the sub- 
strate. In such a case we have 


or Cy + oy = ky 
Now &;; is proportional to the number of weB; in the substrate-enzyme 
complex, thus the probability that any weB; will be in this state is propor- 
tional to k;,/(6; + b?) where thus a we(substrate-enzyme complex) is 
considered as belonging to B;. Repeating the arguments used in connec- 
/ tion with (18) we get 
= Fy BS Ei Ry rae Ry b. 
>t. is , Lie Op aay. 0) 
ng a kes ie Wa a ene 
as b; <b’. Instead of speaking of probabilities we could have used (19), 
which would immediately give the same results. Whichever way we choose, 
we have as an elementary conclusion: 
Theorem 1. If the flux of weB,,.) from (ss); to (ss); can be regarded as a 
reaction of order n > o, then (18) is valid and 


Ag = Fi(B5y~* (26) 


where k;; is the ordinary phenomenological rate constant. 

Evidently, (26) is a more or less trivial consequence of (18), and, from 
the point of view of reaction kinetics, merely another way of writing 
postulate 2. However, there seems to be some confusion concerning the 
influence of the order of reaction on the tracer dynamic equations, not in 
the literature but at the laboratory. Further, the treatment above 
gives a good illustration of the approximative character of (18) and the 
necessity of (9): (9) is necessary except when m = 1, in case of which (18) 
is exactly valid. But, also if (9) is not necessary when = 1 we must 
remember that k,; often is sensitive to variables such as ionic strength 
and pH, variables that may be influenced by the addition of tracer. Thus, 
even when n = 1 the amount of tracer must be kept as small as possible. 
The necessity of this condition is also evident from another point of view: 
at high concentrations there will be an interaction between the tracer 
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particles. The importance of the mother substance may be, among others, } 
that to a certain extent it prevents such interactions. 


5. The Concept of Turnover 


Let C, be a compartment as defined on page 125, and let dX(q’, 7)dt 
be equal to the probability of a transition during time 6¢ from g to 7’. 
If O' =Q—C,, ie. the complement of C, relative the Q-space, we 
define for geC, 


MG ») = | a’G, 9) (27a) |} 

Q", | 
where the integration is performed with respect to q’. Since all the transi- |} 
tions g — q’ are mutually exclusive for different g’, A(g, v)5t is equal to the } 
probability that a u in the state geC, during time 5¢ shall move to one of } 
the states belonging to Q’. We call X(g, v) the turnover coefficient of |} 
B, = B, - C, (see page 126) and define the tracer turnover function of B, as jj 


pt) = | MG MFG 1) ag (28) | 


Cy 


where f,(g, t) is the relative content density as defined by (7). The sig- 
nificance of ¢,(t) is evident: —,(t)dt is the expected probability at time 1 | 
of a ueB, to leave C, during a short time interval 5¢. It should bef} 
remembered that we assume uw given with a fixed atomic number, and] 
that in this respect all our treatment is hitherto and throughout concerned }/ 
with one and the same u. Thus, if we are dealing with u’s having different | 
atomic numbers, then it would be necessary to indicate in the symbols |} 
which atomic number they refer to. 
Now, to each C, there exists one and only one Q and one and only one }} 


GC, =)S=iG> 
and we define the turnover coefficient of B, = B,- C’, as 
AG’, ») = | AG 7) (27b) | 
0’, 


and in terms of that the tracer turnover function of B, 


, = =, mp, 
p(t) = | AG, “AAG, t) dg (29) ] 
Cl, | 

t This means that we assume: if P(q’, , At, t) is the transition probability in postulate 2 ] 
corresponding to the two points g and @’ then o"P(q’, g, At, t)/(0At)” exists for || 


{2 


At=0 


nm =0,1,2,...and for any g and q’ belonging to Q. 
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Here ¢,(t)3t = the expected probability at time t of a ueB, to move into 
C,, during a short time interval 8¢. 

We thus regard O as divided into the two sets of points S and e, after 
which we have selected an arbitrary compartment C, of S, i.e. C, CS. 
Corresponding to this C, there are two complements: Q, with respect to 
O and C;, with respect to S. Further, in terms of the coefficients of turn- 
over A(q, v) and X(g’, v) we define the tracer turnover functions Y, (t) and 

g,(t) such that, when b,(¢) and /(t) are the contents of B, and B’, respec- 
i ively at time ¢t, the expected fluxes of tracer from C, and from C; into 
C, are given by ¢,(t)b,(t) and ¢,(t)b,(t) respectively. Hence, if ¢,,(t) = the 
flux of tracer from e to C,, we have the ordinary continuity equation of 
the form 


b(t) = ¢(t)b,(t) — 9,(2)b,(t) + £,0(2) (30) 


where as usual we regard the fluxes not as stochastic but as deterministic 
variables. 

Of interest is the tracer turnover function 9,(t). Now, according to 
_ postulate 2 the tracer and the mother substance have the same turnover 
coefficient A(g, v), hence the turnover function of B, is 


95(t) = | AG 2G, 2) ag (31) 


Let us make the following definition: S is said to be perfect if it is not only 
stationary (which here implies that AG, v) ie = =e v) = 0) 


G’eCy’ 
cycs 


geCy Cs 


but also if 6°(@, t) =o. Thus, if S is perfect then f9(q, t) 
0 


tm 


geS 


= 7(g) and we get from (31) 


qeCy CS 


g(t) = [AG v)n9(Q) dg = ® (32) 


Hence, when S is perfect ¢%(4) |, 9 = A’ and we shall call this parameter 
the turnover factor of B,, which thus refers to the mother substance of B,. 
_ This makes it possible for us to define the turnover of B, as 


hy = A, ° (33) 
_ which is the flux of mother substance from C,,. 


When 5S is perfect the fluxes of the mother substance are all constant 
with respect to t, and only then can we speak of a turnover. However 
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common the concept of turnover may be, it is very seldom rigorously | 
defined and in the literature different opinions concerning its meaning 
can be found. For instance, Bray & White (1957) define the turnover in 
terms of “the amount of substance removed in unit time’, i.e. the flux 
out of C,, while Robertson (1957) seems to be of the opinion that one 
should use two turnovers: the flux into C, and the flux out, in case they 
are different. Now, if the turnover is intended to describe certain proper- 
ties of B®, then it is not justified to regard the influx as the turnover: the 
influx is equal to (¢?’(t)b°’(t) + &9,(t) ) and thus dependent on variables 
or parameters that are characteristics of B)’ and ¢),(t). But as we are 
discussing here a perfect S, this problem is more or less of academic 
interest only, as in such an S the influx from OQ, must be equal to h? defined 
by (33). The assumption that S is perfect may perhaps at a first glance look 
quite restrictive. However, we have assumed that S is stationary, otherwise 
no description of the dynamic state of S in terms of constant parameters 
would be possible (see page 127). Further, from theorem 1 we have that, 
unless we assume all the fluxes to follow first order kinetics, variations in 
b° are likely to influence the turnover coefficient A(g, v), hence, if we | 
actually want to speak of a turnover factor, 5? must in general be constant. 
Also necessary for the existence of A? is that f°(g, 2) = 7°(@) for all geC,, 
and we therefore understand that the assumption that C, is perfect is, 
if not quite necessary, difficult to avoid. As this implies that the influx and 
efflux of the mother substance in C, must balance each other, a moment’s 
consideration will make evident that also C’ must be perfect, otherwise 
theoretically possible but physically unrealistic assumptions concerning 
the properties of BY’ and €°,(¢) are necessary. However, there seem to 
exist systems where a perfect state is never attained, e.g. some endocrine 
systems (Danziger & Elmergreen, 1957), and such systems are not, of 
course, included in our treatment. 1 
We have thus defined a turnover or, perhaps the more rational concept, |} 
a turnover factor. The definition is rigorous, and applicable to a large class |] 
of systems: there are no restrictions on the choice of the compartment C,, |} 
e.g. we may take C, to be the non-protein alanine of the liver, or the total |] 
liver alanine, or the whole set of amino acids in the liver and so on. Now, | | 
if we have chosen C, to be the non-protein alanine in the liver and we } 
regard the case where u = nitrogen, then B® is equal to the set of all 
naturally occurring nitrogen atoms in this compartment. It is then evident | 
that the corresponding h? may be quite different from the one, for instance, 
that corresponds to u = oxygen. Thus, when we concern ourselves with 
a certain metabolite, the molecule of which contains atoms with different | 
atomic numbers, it is in general not possible to speak of only one turnover 
of that metabolite: the turnover must be specified with respect to a special 
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atom or a kind of atoms contained in the molecule of the metabolite 
considered. 

It is important that the turnover concept introduced here refers to an 
atom chosen at random from the whole population of the atoms under 
consideration; thus it can be used rigorously without referring to any 
detailed model such as a ‘“‘precursor map”. A simple but important fact, 
which also makes the significance of the (ss) concept more evident, is that 
according to (8) 


o(t) = [AG DFG, 2) dg = — Au [G2 dg=—Au 


(SS) i (ss) i 


= — deo | RG, 1) dg = He) (34) 
(88) i 
Hence, for any subsystem there exists always a turnover factor and a 
“tracer turnover factor’’ such that A? = Ai = — Aii, where Ai: has the same 
significance as in the preceding section. 
If now lim 7,(g, t) = 7,(g) exists for all geC, and for any C, CS, S is 


t—> 2 


said to be a limiting S. If S is a limiting S then 
lim 9,(t)=A,, lim 9,(t) =a, (35) 
t—>o 


t— 2 
both exist for any C, Cc S, and we shall refer to 4, as the tracer turnover 
factor of B,. On comparing (28) with (31) we immediately have: 
Theorem 2. If S is a perfect and limiting S, the turnover factor A? is equal 
to the tracer turnover factor A, when (1) 7,(9) | geo, = 70(Q) | gc, and/or 


d 
(2) di AG, v) O. 
Condition (2) is an analogue to the “lumping effect”’ first described by 
Sheppard & Householder (1951) and later in a somewhat more general 
manner by Hart (1955) and which in light of (34) can be expressed thus: 
when X(g, v) is constant for all geC,, then SS behaves as if C, is a subsystem. 
However, this theorem faces us with the question of how to perform a 
tracer experiment in such a way that the tracer “represents” the mother 
substance. Together with the problem of the determination of A, from 
experimental data, this question forms the foundation of the fundamental 
problem of tracer kinetics and to this we shall return in a later com- 
munication. 


GC, = 
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1. The properties of the “blue mechanism”’ are described. It is shown that 
they differ in many ways from those of the ‘‘red and green mechanisms’’. 
2. The possible contributions of rods to photopic vision are outlined and 
discussed. 
3. There are strong arguments both for and against the view that rods 
are concerned with the “blue mechanism”’’. 
4. Evidence is put forward for the inhibition of rods by cones. 
5. It is suggested that in photopic conditions the residual activity of the 
scotopic mechanism, i.e. the activity of the rods which escapes inhibition, 
constitutes the basis for the blue mechanism. 
6. Hypothetical luminosity curves for the blue mechanism in protanopes, 
deuteranopes and in normal subjects have been plotted on this basis. 
It is deduced that the blue mechanism could be dependent upon rods 
being inhibited by cones in accordance with rules similar to those which 
Hartline has found to hold for the somewhat similar inhibitory pro- 
cesses which occur in the retina of Limulus. Possible explanations for the 
existence of two blue mechanisms (the 7, and 73, mechanisms) as revealed 
by Stiles are put forward. The hypothesis accounts reasonably well both 
qualitatively and quantitatively for many of the main features of colour 
vision. 
7. Approximate luminosity curves for various retinal locations are 
predicted in terms of the estimated numbers of receptive fields for rods 
and cones in those areas, on the assumptions that luminosities are 
directly additive and that, probably because of the average areas of their 
receptive fields, about 7-5 rods are equivalent to one cone, in so far as 
luminosity is concerned. These curves are directly comparable with 
those actually found. 
8. Various anomalous observational data appear to be explicable in terms 
of the hypothesis which is outlined. 
9. The hypothesis is certainly not without its difficulties; there are at 
least two very strong arguments based on the accepted principles of 
colorimetry which will have to be satisfactorily answered before it can be 
accepted. 
10. On the other hand, if rods play no part in colour vision three equally 
difficult questions must be answered. They are :— 
(a) How is the response from rods eliminated from photopic vision ? 
(b) Why do the hypothetical “‘blue’’ cones have such different 
properties from the “red” and “green’’ cones, if they are them- 
selves cones ? 
I4I 10—2 
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(c) Why is there no direct anatomical or physiological evidence for the 
existence of these “blue” cones, if they are so different in kind ? 
11. The ideas put forward in this paper are intended more to stimulate 
thought on the nature of the blue mechanism, which still remains the 
outstanding problem in the physiology of colour vision, than to constitute 
an exact description of the mechanism 


Introduction 


Human colour vision is now firmly believed to depend upon three separate 
receptor mechanisms. In normal subjects the three can be reduced to two 
by confining vision very strictly to the central fovea (i.e. by directing the 
gaze steadfastly at objects subtending less than 0.5° at the eye). By this 
means, the so-called blue mechanism can be effectively excluded (K6nig, 
1894; Willmer, 1944; Willmer & Wright, 1945). The eye is then dichro- 
matic and tritanopic. In red-green-blind subjects the simplification can be 
carried one stage further by the same technique and single mechanisms 
can then be isolated (Willmer, 1949b). The red-blind subject, or protanope, 
shows only the green mechanism in his central fovea and at least some 
green-blind subjects, or deuteranopes, show only the red mechanism when 
their vision is similarly confined to the foveal centre. Such normally 
dichromatic subjects become monochromatic. 

In 1955 the results of an analysis of central foveal vision were published 
(Willmer, 1955) and it was shown that much of the reduced normal vision 
characteristic of this area (i.e. the dichromacy and tritanopia, could be 
explained on the basis of these two mechanisms, the green (P) and the 
red (D), each of which was thought to depend upon a single carotenoid- 
protein type of pigment. It was shown that the sensations of colour (i.e. 
the saturation and hue) which could be evoked from this area were closely 
related to the differences between the ordinates of the spectral sensitivity 


curves for the two mechanisms (as determined by the data obtained from | 


the foveal centre of colour-blind subjects) when these curves were plotted 
on a logarithmic scale and so linked as to have the same sensitivity at the 


observed neutral, colourless or white point in the spectrum, which was |} 


found to lie at 574 mp (Fig. 1). Since that time, the work of Rushton 
(1957, 1958) on the pigments of the fovea of normal and colour-blind 
subjects has confirmed the belief that the P mechanism (i.e. the green 


mechanism, which is present in isolation in the protanope or red-blind) | 


does indeed depend on a single pigment. His data for the D mechanism 
are, however, not so clear-cut, and it is not yet certain from them whether 
this mechanism depends entirely on a single pigment or not. 


The spectral sensitivities of these two mechanisms, as isolated in the | 


central fovea and represented by the P and D curves of Fig. 1, are closely 
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similar to those which Stiles (1953) has called the 74 and 7; mechanisms, 
on the basis of his work on increment thresholds with two-colour fields. 
By this technique, the eye is adapted to a certain intensity of a coloured 
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Fic. 1. The sensitivity curves of the three known receptors in the human eye. 

R. The rod mechanisms (curve from Flamant & Stiles, 1948) dependent on rhodopsin. 

P. The central foveal luminosity curve for the protanope (Willmer, 1955). The 
receptors are probably dependent on chlorolabe (Rushton, 1958b). The open circles 


represent the 7, mechanism of Stiles (1953). 
D. The central foveal luminosity for the deuteranope (Willmer, 1955). These receptors 
may depend on a mixture of chlorolabe and erythrolabe (Rushton, 1958b). The filled 


circles represent the 7; mechanism of Stiles (1953). 

The P and D curves have been linked at 574 mp, which is the neutral point in the spectrum 
for the dichromatic central fovea; at this point the sensation changes from green to red 
(Willmer, 1955). 

The R and D curves have been linked at 610 mp, since the directional effect of light 
which only affects cones becomes detectable from about this point to the red end of the 


spectrum (Flamant & Stiles, 1948). 


field and a test patch of another colour is superimposed upon it. By 
measuring the luminance of the test field necessary for the superimposed 
stimulus to be perceived, and by varying the intensity of the adapting 
field, it is possible to study the effects of the adapting wavelength on the 
sensitivity of the mechanisms used in the perception of the test patch, and 
thus by varying the wavelengths of the two fields, to determine the 
sensitivity of these mechanisms to the various parts of the spectrum. For 
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example, if vision of a given test colour were to depend upon two mech- | 
anisms, then with no adapting field a threshold could be found for the | 
perception of the test colour. If then an adapting field were turned on this 
would probably depress the sensitivity of one of the mechanisms more 
than the other (depending on the wavelengths used) and so the threshold | 
for the perception of the test object, would be raised, and, as the intensity 
of the adapting light was further increased, it would continue to be raised 
in a regular manner until the adapting field began to affect the sensitivity 
of the second mechanism. There would then be a discontinuity in the | 
intensity-threshold curve. Thus it is possible to find that intensity of the | 
adapting field at any wavelength which will just alter the sensitivity of the }} 
various mechanisms involved in vision at different wavelengths: Stiles has 
shown by this method that there are at least five such mechanisms (7, to 
m5); i.e. there are five mechanisms that can have their thresholds altered 
by adaptation in the manner described. The technique does not indicate 
whether these mechanisms are actual receptors or some other susceptible 
levels in the neural pathways of vision. 

In the earlier paper on central foveal vision it was suggested that, since 
the frequency of nerve impulses in the nerves leading from photoreceptors 
is, in general, much more closely related to the logarithm of the intensity 
of the stimulus than to the intensity directly, the difference between the 
ordinates of the sensitivity curves (plotted on a logarithmic scale), is really 
a measure of the difference between the frequencies of impulses set up in 
the two pathways, and this difference was shown to be related closely to 
the saturation of the colour. Thus, in vision by the foveal centre, when the 
P mechanism initiates a higher impulse-frequency than the D mechanism, |} 
sensations of blue-green are set up; when the D mechanism is the more 
active, various degrees of redness result. The saturation or vividness of the 
colour, in each case, increases with the extent of the excess activity of the |] 
one mechanism over the other. When the two mechanisms are responding | 
equally, a sensation of “no colour” is produced. The saturation of the | 
colour thus depends upon the magnitude of the difference; and hue. 
discrimination, in its simplest form, depends upon the rate oF change of 
this srepntontl: with wavelength. 

Though, at the time, this was a purely hypothetical concept based on. 
human central foveal vision, the recent work of Svaetichin and his colleagues’ 
(MacNichol, Macpherson & Svaetichin, 1957; Svaetichin & MacNichol, |] 
1958) on the retinae of fish has provided experimental evidence for the |} 
existence of a mechanism, in these animals at least, which could be inter- 
preted to be working in exactly this way. They have shown, by inserting 
micro-electrodes into the retinae of certain fish, that there are cells or | 
structures that show an increased negative potential in response to 
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illumination with light from the blue end of the spectrum and a positive 
potential with light from the red end of the spéctrum. Clearly, if other cells 
were influenced by these increased positive or negative potentials, their 
responses would give precisely those required for such a mechanism as 
that postulated for human foveal colour-vision. It is of course true that 
this evidence, which is the only direct evidence, is derived from fish and 
not from man, but, at least, it is now clear that the suggested mechanism 
is certainly possible, at the neurological level, since it actually occurs in 
other animals. 

Furthermore, MacNichol, Macpherson and Svaetichin have shown 
that among the horizontal cells, or closely related structures, there are 
some that, upon illumination, show slow and sustained negative poten- 
tials of an extent which in scotopic conditions varies spectrally in the same 
manner as the sensitivity of rhodopsin. Under photopic conditions, in 
some species at least, the maximal spectral sensitivity for these negative 
potentials shifts towards the red as in the typical Purkinje shift. The 
magnitude of these potentials is, at any one wavelength, rather closely 
related to the logarithm of the intensity of the light. They regard these 
cells as luminosity detectors, and it is interesting that they are distinct 
from those which could be interpreted as colour detectors. 

In the paper on foveal vision (Willmer, 1955) it was also shown that, by 
making certain simple assumptions, the central foveal luminosity curve 
could be derived from the spectral sensitivity curves of the two mechanisms 
(the P and D mechanisms). It was assumed that the light of any wave- 
length affected the two mechanisms in such a way that each produced its 
own characteristic effect on the first synaptic connections leading to the 
bipolar cell concerned with luminosity. The effect produced depended 
partly on the sensitivity of the receptor to each particular wavelength and 
partly on another factor, which was probably the number of receptors of 
each type involved, i.e. on the number of receptors whose responses were 
transmitted to the particular bipolar cell. The effects produced by each 
receptor then added together in stimulating the bipolar cell; the response 
of the bipolar cell, when thus stimulated by the changes in the first synaptic 
layer, was proportional to the logarithm of the added effects. The data 
were such as to suggest that, in the central fovea, the D receptors were 
either much more numerous than the P receptors (P : D = 2 : 3) or that, 
for some other reason, they contributed much more to the common 
synaptic pool than the P receptors. The luminosity curve was found to 
result from the expression :— 


log(xk,p + yk,d) 


where p and d represent the actual intensities of the stimulus at any given 
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wavelength in so far as it affected the P and D receptors respectively: Ry } 
and k, are factors relating the stimulus to the response (they were actualld | 
made venta to 1 since it was assumed that this was likely to be very much | 
the same for both types of cone) and x and y represent the relative numbers | 
of the two types of receptor. Empirically it was found that a good fit with | 
the experimentally determined luminosity curve was obtained when 
M2 and 4 == 3. 

The experimentally observed differences in the shape of the luminosity |} 
curve (Thomson & Wright, 1947) with central fixation, and at 20’ |) 
and 40’ off-centre, were best fitted with values of xk, and yk, in the }} 
following ratios: central, 2 : 3; 20’ off-centre, 2 : 2; and 401} 
off-centre 2 : 1 respectively. These effects could be explained either by, 
differences in relative numbers (x and y) or by variations in the effective- | 
ness of the stimuli on each of the receptors (k, and k,) or both; the first |} 
explanation seems the more probable. All these calculations were based on 
the central foveal measurements only, and they neglect the presence of | 
the “blue” mechanism, though certain aspects of this mechanism are 
certainly beginning to be appreciable at 40’ off-centre. 

In the following pages, vision in the whole eye, i.e. trichromatic vision, 
will be considered on the same lines as those adopted for vision by the 
central fovea and in relation to the spatial distribution of the known|}) 
receptors in the human retina. The nature of the blue mechanism is of |} 
course the main enigma, and some of its properties and peculiarities will 
first be outlined. 


The Properties of the Blue Mechanism 
1. From colour-mixture data (Wright, 1946), the blue mechanism has | 
its maximum sensitivity at a wavelength about 450 my, and contributes 
little or nothing to vision at wavelengths longer than about 540 mp in the} 
normal eye, about 540 mp in deuteranopes, and about 550 mp in protanopes. 
2. From increment-threshold data (see p. 143) in the normal eye, Stiles | 
(1953) has established the existence of two separate mechanisms, 77, and | 
m, (Figs. 2 and 3), both maximally sensitive in the neighbaurhead of | 
450 mp. Of these, 73 could perhaps be initially dependent directly upon a 
single pigment of the rhodopsin group, since its spectral sensitivity curve |} 
is of the right shape (Dartnall, 1957), but the spectral sensitivity of the 7, |] 
mechanism would be difficult to reconcile with such an hypothesis on ac- | 
count of the curious hump between wavelengths 580 and 600 my. If the. 
visual pigment of this mechanism is, as in most other visual mechanisms, a 
carotenoid- -protein, it must be “‘screened” in some way. Alternatively the 
“mechanism” is a “neural” path and not directly receptoral. 
3. The blue mechanism can be eliminated from normal vision by using] 
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only the central fovea. Whether this “elimination” depends on actual absence 
of the mechanism from this area, or on the action of some inhibitory 
mechanism which comes into play with central fixation, is unknown. 


4 


Sensitivity (1/logjig E) 
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Fic. 2. Curves showing the effect of inhibition of the rods by the P cones, as in the 
protanope, in accordance with the type of inhibition which has been shown by Hartline 
and his colleagues (Ratliff, Miller & Hartline, 1958) to occur in the eye of Limulus. 

R. Rod sensitivity curve; P. Sensitivity curve of P cones (see Fig. 1); 7, threshold 
above which the mechanisms produce responses in proportion to the ordinates. 

Curve I. B= R—1-0(C — CC) 

Curve II. B=R-—1-5(C —C)) 

Curve III. B= R — 1-75 (C — C,) 

Where B = ordinates expressing the sensitivity of the blue mechanism, R = ordinates 
of the R curve measured from T, C — Cy) = ordinates of the P curve measured from T¢, 
the point at which the cone response becomes effective as an inhibitory stimulus. 

o——o0, the 7, mechanism of Stiles (1953). 


There is some evidence that very small and stationary images falling else- 
where on the retina also tend to be seen in a somewhat tritanopic manner, 
i.e. without the intervention, or with much less intervention, of the blue 
mechanism (Hartridge, 1947). A completely stabilized image anywhere on 
the retina soon ceases to arouse any sensation of colour (Ditchburn & 
Ginsborg, 1952; Riggs, Ratliff, Cornsweet & Cornsweet, 1953; Ditch- 
burn, 1958). 


4. Apart from this elimination from the central fovea, the blue mech- 
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anism shares with that for yellow the widest distribution over the retina. }} 
The visual fields for blue and yellow are considerably larger than those | 
for red and green, and, when bright enough lights are used, they probably | 
extend over the whole retina. Indeed, the retinal periphery has, under | 
photopic conditions, properties that are closely similar to those associated | 
with dichromacy, and probably with deuteranopia. 


Sensitivity (1/logjg E) 
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Fic. 3. Curves showing the effect of inhibition of the rods by the D cones of the | 
deuteranope. The curves are derived in exactly the same manner as those in Fig. 2. 
o—-—o0, the 73 mechanism of Stiles (1953). 


5. In both deuteranopes (green-blind subjects) and protanopes (red- 
blind subjects) the blue mechanism is at least normally developed, and 
these subjects see all colours in shades of either blue (violet?) or yellow. | 
The spectral colour-mixture curves for protanopes, deuteranopes and | 
normal subjects indicate that the blue mechanism contributes pro- | 
portionally to very different extents in each of these types; nevertheless the | 
sensation of blueness is, in general and like that of yellowness, only very | 
rarely absent or grossly abnormal. This suggests that the mechanisms 
responsible for these sensations depend upon some much more constant | 
and stable features in the eye or visual pathway than do the red and green 
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mechanisms, which themselves seem often to be ‘“‘fused”’ into a common 
mechanism, namely the yellow mechanism itself, 

6. While deuteranopia and protanopia, in which the abnormalities 
certainly involve the cones or their connections, are inherited as sex- 
linked allelomorphs, tritanopia (blue-blindness) is inherited quite separately 
as an autosomal recessive (Kalmus, 1955). The gene for the development 
of the structures responsible for the blue mechanism is thus carried in a 
different chromosom2 f on the genes for the red-green complex. 

7. Adaptation to strong yellow light (of wavelength about 575 my) 
produces very little change in the appreciation of hue when vision is 
confined to the central fovea, but, with larger fields, a state of ‘violet 
monochromatism” is produced (Brindley, 1953). In this state, all wave- 
lengths between 400 mu and 500 my appear identical and as a very sat- 
urated violet. In this condition the blue mechanism is probably acting in 
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Fic. 4. The proposed sensitivity curves for the three photopic mechanisms of the 
normal eye. 

P =P cones; D = D cones (see Fig. 1). Lie. ; 

B =the mean curve of the two curves marked III in Figs. 2 and 3. This represents 
the probable residual activity of the rods after inhibition by a mixed population of P and 
D cones. é ; 

o——o, Log,) increment threshold curve after adaptation to a high intensity of 
yellow (578 my) (Stiles, 1953) which isolates the blue mechanism (Brindley, 1953). 


great excess over the other mechanisms but, unfortunately, its spectral 
sensitivity cannot easily be measured because, for some unexplained 
reason, the judgement of luminosity is extremely bad under these con- 
ditions. This quality of the blue mechanism seems to be rather character- 
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istic and it will be remembered that luminosity measurements in the blue} 
part of the spectrum are notoriously unreliable, even in the normal eye.| 
The spectral sensitivity, as determined by increment threshold measure-} 
ments under these conditions (i.e. with adapting field of wavelength 5784 
my) is, however, shown in Fig. 4 (Stiles, 1953). The fact that this ““mono~ 
chromatism” produces a sensation of colour is also peculiar, since othe 
forms of monochromatism (e.g. rod vision, central foveal vision in pro-j} 
tanopes and deuteranopes, central foveal vision in the normal subject after 
adaptation to either red or blue-green, or vision by the whole eye afte 
adaptation to strong violet light followed by adaptation to blue-green or} 
to red) only evoke colourless or extremely unsaturated sensations. More-}} 
over, recovery from the violet monochromatism is sudden (Brindley, 195 3)) | 
as camipared with that from other forms of adaptation-monochromatism;}} 
unlike most other forms of adaptation, however, it leaves a very prolongedl 
after-image (Stiles, 1953) though, interestingly enough, adaptation tof) 
bright violet light (i.e. the complementary hue) also produces similarly} 
prolonged after-effects. 

8. After adaptation, either to strong white light or to monochromaticy} 
stimuli, the blue mechanism recovers its sensitivity at quite a differenti] 
rate from the red and green mechanisms, which generally recover inj} 
comparable fashion. 

After adaptation to most wavelengths, but especially to yellow and even, 
to blue, there subsequently occurs the phenomenon of “‘positive blue’’, inj] 
which the blue mechanism appears to be abnormally prominent (Wright, |} 
1946). By comparison with what is seen by the other (unadapted) eye, alli] 
colours are tinged with blue after such adaptation. 

Stiles (1949) found that after adaptation to red light the 7, (blue)if 
mechanism at first gained sensitivity momentarily and then suddenly lostif 
it to such an extent as to be virtually suppressed. 

g. When a disc, whose surface is divided into black and white sectors, isif 
rotated at a certain speed, a flickering image is obtained in which flashes of | 
vivid bluish violet alternate with brilliant yellow. At very much higher} 
speeds, green and pink flashes take their place. Thus, in this experiment, | 
there are again stronger associations between blue and yellow and between] 
green and red than between other combinations, indicating that the time 
relationships of excitation and recovery of the blue and yellow mechanisms } 
are related but differ from those of the red and green mechanisms. | 

10. Under photopic conditions, the relationship between the critical] 
fusion frequency (i.e. the frequency at which a flickering light fuses to give | 
an apparently continuous sensation) and the intensity of the light is} 
different for blue light and for red light. | 

11. While the blue mechanism contributes greatly to saturation and) 
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hue, it contributes very little to luminosity; in this way it again differs from 
the red and green mechanisms. , 

12. The law of additivity of colour mixtures is found to be very nearly 
strictly true for wavelengths from 650 mu to 540 mp, where the blue 
mechanism makes little or no contribution; but, from 540 mp to the blue 
end of the spectrum, there is a progressive deviation, in the sense that the 
field containing the more blue component is always less bright than ex- 
pected (Federov, 1958). This, again indicates that the blue mechanism 
contributes to luminosity in a manner which is different from that of the 
other two mechanisms. 

13. Luminosity discrimination during violet monochromatism, as 
‘mentioned above, is very poor. It is not so noticeably poor in other forms of 
monochromatism, except in that associated with rod vision. In the mono- 
‘chromatic foveal vision of some deuteranopes it is, however, somewhat 
below normal. 

14. The Fechner fractionst of the 7, and 73 mechanisms (i.e. the blue 
mechanisms) are much higher, at 8.7%, than that of either the 7, and 7, 
mechanisms, at 1.9% and 1.8% respectively (Stiles, 1953). It will be 
remembered that the 7, and 7; mechanisms are probably identical with 
the P (green) and D (red) mechanisms of the central fovea which certainly 
involve cones almost, if not quite, exclusively. This value of 8.7°% more 
nearly resembles the figures associated with the rod mechanism, under 
scotopic conditions which are much higher than those associated with 
cone vision in general. 

15. The quantum efficiencies { of the 7, and 7, mechanisms (5 x 107% 
and 1.4 107) are noticeably less than those of the other cone mech- 
anisms (7, and z;) which are 3-8 x 107% and 3.7 X 107% respectively. 
They are also considerably less than that of rods (5 x 10-*) (Barlow, 
1958b). The blue mechanism therefore is, in this way, very different from 
the others and has a considerably lower quantum efficiency than either the 
rods or the cones responsible for the red and green mechanisms. 

16. Measurements of summation areas§ and of visual acuity under 
conditions like that of violet monochromatism, in which the blue mechanism 
is isolated or at least highly dominant (Stiles, 1949; Brindley, 1954), 
suggest that the receptive fields of the cells involved in the blue mech- 
anism are very much larger than those involved in the red and green 


+ The ratio of a just detectable increase in intensity to the intensity of the stimulus 
(A 1/1). ; ances 
t The smallest fraction of the quanta entering the pupil needed to produce a minimal 


effect in the mechanism. 
§ Within the summation area, intensity * area is constant, i.e. the effects of stimulation 


of adjacent receptors are pooled within a certain area, probably the synaptic area of 
bipolar cells or ganglion cells. 
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mechanisms, e.g. they may subtend 7 to 18’ as compared with 1 to 4} 
They are somewhat smaller than those of rods in the dark-adapted state}} 
but there is evidence that rods increase their summation areas during}} 
dark-adaptation (Barlow, 1958a; Barlow, FitzHugh & Kuffler, 1957). ‘Thus 
the size of the receptive field for the blue mechanism is probably compar']} 
able with the “photopic” receptive fields for rods more nearly than it i:ff 
with those for the red or green mechanisms which indubitably depenc}} 
upon cones. 1 | 

17. When the dark-adaptation curve is plotted after the eye has been} 
adapted to orange light, and a violet test light is used, the curve is found taj 
have certain extra cusps on it which Auerbach & Wald (1955) have} 
suggested may indicate the existence of violet receptors. Other explana} 
tions are, however, possible for this phenomenon, since such points coul 
indicate the entry of some neural mechanism; e.g. the threshold of df 
neural pathway perhaps dependent on the activity of several types ojff 
receptor. ; 

18. A uniform visual field, subtending about 4°, but of such low lumin 
ance as to stimulate rods only, normally appears to be colourless, but iff 
may darken and change to blue as soon as an adjacent comparison-field! 
stimulating mainly the cones (i.e. a red, orange or yellow field) is pro 
vided (Willmer, 1949a). A similar phenomenon has been described b 
Weale (1953) when a violet field (A = 460 my), seen in the mesopic rang} 
(i.e. in the range between scotopic (night) conditions and photopic (day ? 
light) conditions), is surrounded by a white field. The violet increased} 
noticeably in saturation (i.e. it becomes more intensely violet). | 

1g. The threshold for a violet test light is raised far more by the juxta 
position of a red light than is the threshold for a similar red light by tha} 
juxtaposition of either red or violet. 

20. The sensations of blue, violet and purple are often associated withifi 
twilight conditions, when rods are beginning to dominate vision (Hunt}f, 
1952; Knowles-Middleton & Mayo, 1952). Night-blind people are some: 
times tritanopic (i.e. lacking in the blue mechanism and having visiori 
similar to that shown by the normal central fovea). Moreover, tritanopia is} 
often associated with retinal abnormalities and particularly with thal 
involving damage to the rods (Helmholtz, 1924; Parsons, 1924). | 

21. Temporary exposure to a bright source of light in a dark roo 
generally brings on the sensations known as after-images; these are aif 
first continuous and appear in the same colour as the stimulus; they ther | 
become discontinuous and in the complementary hue. Finally, a phase of 
intermittent blue and black after-images may be perceived. These inter-f 
mittent blue images are not easily, if at all, evoked from the foveal region 
(Trezona, 1959). 


a 
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22. Some curious anomalies are connected with the chromaticity of 
brown. Browns are, of course, not spectral colours, though a sensation of 
brown can be obtained from spectral orange or yellow when it is noticeably 
dimmer than a comparison field of the same wavelength. On the C.LE. 
colour chart, browns are located, not on the spectral locus, but well 
within the “triangle” (see Wright, 1946) and this means that they involve 
the participation of the blue primary. The use of the blue primary is not, 
however, necessary in order to match orange or yellow, since these are both 
“dichromatic colours” involving only the red and the green primaries. Do 
these observations therefore mean that dimming the orange light auto- 
matically evokes the blue mechanism? This is certainly queer because the 
simple addition of blue to yellow or orange converts these colours to white 
or pink. 

23. In some diabetics, there is a reduction of the red and green visual 
fields, even amounting to their total loss. This has been found to be 
associated with degeneration of the small ganglion cells. The blue fields, 
however, remain intact (Rénne, 1913). This points to a connection be- 
tween the blue mechanism and the large ganglion cells which remain un- 
damaged, and is another example of the difference between the blue mech- 
anism and both the red and the green mechanisms. 


Rods in Photopic Vision 


Such then are some of the peculiarities associated with the blue mech- 
anism. It will have been noticed that many of them (i.e. 2-16, 18, 19, 21, 
22 and 23) indicate that the blue mechanism depends on something 
different in kind from both the red and the green mechanisms. Moreover, 
several of the peculiarities strongly suggest a relationship between the 
slue mechanism and the rod mechanism (see 3, the retinal distribution; 
(3 and 14, brightness discrimination; 16, the receptive field; 18, the effects 
yf an adjacent field on a scotopic white; 20, twilight vision; and 21, the 
ylue after-images). It might be expected that if a “blue” cone were as 
lifferent from the “‘red”’ and the “‘green”’ cones as some of its hypothetical 
yroperties suggest (see 2-8, 10-13, 18 and 19g), it might be possible to 
lifferentiate this type of cone by some histological, cytochemical or 
levelopmental peculiarity or by virtue of some difference in its synaptic 
onnections. Nevertheless, such evidence is entirely lacking. Histologically 
here are possibly two distinct classes of cones (Kolmer, 1936; Leach & 
Villmer, 1950), and there are at least two distinguishable ways in which 
ones connect with the bipolar cells, but beyond that there is no direct or 
tructural evidence for the further subdivision of the cone population. It 
nust, of course, be conceded that a cone could look exactly like the existing 
nes and yet have the peculiarities of function that are associated with the 
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blue mechanism. Meanwhile, however, there are twenty times as manyj} 
rods as cones in the human eye and their function in daylight vision, if any, 
still eludes us. | 
It is very much to the point therefore to consider first how the rods arel| 
likely to behave in photopic conditions, and then to discuss whether they} 
could possibly provide the basis for the blue mechanism. ‘ll 
The observations of Rushton, Campbell, Hagins & Brindley (1955), on | 
the bleaching of the pigments in the living human eye, show quite clearlyjf 
that rhodopsin is present in appreciable amounts in states of light-}} 
adaptation at which the full colour sense is in operation. Rods must there-}} 
fore play some part in photopic vision unless one of the following events 
occurs :— 
(a) The photochemical breakdown of rhodopsin ceases, becomes}} 
different in kind or, in some other way, becomes disconnected froni}} 
the excitation of the synapses. 
(b) The rods are so violently stimulated that they are always discharging Bs 
at their maximum rate and so do not contribute differentially 1 ut 
different parts of the spectrum. While there is evidence that this is} 
true at high intensities and in the spectral regions where the rods ard 
| 


very sensitive (Aguilar & Stiles, 1954), it would not be expected taf 
apply at lower photopic brightness levels nor in those spectigy | 
regions where the sensitivity of the rods rapidly declines or wherd 
the sensitivity of the cones may be very similar to that of the rods} 
However, from experiments on animals, it is worth noting that} 
ganglion cells connected primarily with rods do apparently attain 
maximum rates of discharge more easily than those to which cone 
connect. The saturation of the rods in photopic conditions should 
not, however, make their activity entirely undetectable. 
(c) The activity of the rods is suppressed at photopic levels by a direc 
negative feed-back mechanism set up by the activity in neighbouring} 
rods. It is certainly true that in animals some ganglion cells, activated 
only by rods, can become “‘silent” during illumination and that, 
at high intensities, they function only as pure off-elements, as the} 
result of just each a mechanism. However, once again, ite lowetl} 
brightness levels and the spectral extremes raise difficulties ; for, unden} 
these conditions, the rods would contribute with something of theit} 
own spectral sensitivity and this is inconsistent with the colour] 
mixing data. | 
(d) The rhodopsin mechanism changes at higher brightnesses to be- 
come dependent upon some other photochemical substances of less 
sensitivity and with its maximum activity near 450 mp. Visual 
yellow, for example, would fill this role, but it is not known to bel 
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photosensitive. Conceivably, the re-synthesis of rhodopsin could 
overtake its breakdown and this might. stimulate the rod cell in an 
opposite sense to that achieved by the breakdown. 

As soon as the cones reach a certain threshold of activity they 
begin to suppress the action of the rods by direct inhibition. Several 
reasons can be advanced in favour of this view and the possibilities 
will be more fully discussed in the later parts of this paper. Some of 
the main reasons are as follows :— 

(1) In many animals the reactions of rods and cones are known to 
be opposite in character; e.g. the opposite directions of their photo- 
mechanical movement in catfish, amphibia, etc. (Detwiler, 1943). 
The inhibition of rods by cones is, of course, not inconsistent with 
the existence of a simultaneous but smaller effect of the opposite 
kind, namely the inhibition of cones by rods. The latter effect must 
necessarily be smaller because cone vision certainly occurs in 
photopic conditions over most, if not all, of the visible spectrum. 

(1) Electrical recordings from optic nerve fibres and _ retinal 
ganglion cells show that a greater number of “on-off” units (i.e. 
units responding both when the light is turned on and when it is 
turned off) occur in retinae containing both rods and cones than in 
retinae containing preponderantly either rods or cones. The 
association of pure ‘‘on-elements” with rods and the increase in 
both “on-off” and “‘off”’ units when both rods and cones are present 
suggest inhibition by cones as well as that known to be caused by the 
rods themselves. Some at least of the ‘‘on-off” and “off” units have 
peculiarities in their spectral sensitivity curves which indicate that 
these units depend on two different types of receptor (Granit & 
Tansley, 1948). 

(iii) The effects of passing constant currents directionally across 
the retina on the types of electrical responses obtained from the 
ganglion cells (Gernandt, 1947, 1948) are consistent with the view 
that rods and cones produce opposite electrical effects when stimu- 
lated, or that they are in some way orientated differently or polarized 
in opposite directions. 

(iv) Some monochromats, i.e. some totally colour-blind subjects, 
who show central scotomata (blind spots corresponding to the foveal 
centre) and a probable absense of cones, also have photophobia, 
suggesting that normal photopic conditions are dazzlingly bright to 
them. This would follow if, in the normal person, the scotopic or 
rod mechanism becomes suppressed when the cones become active, 
since, in the monochromat without cones, there would be nothing 


to damp the activity of the rods. Man has, after all, emerged from 
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ancestors that were essentially nocturnal and that presumably }} 
had a highly sensitive scotopic mechanism. This would be too great |}} 
an asset to lose and its temporary suppression in daylight would 
seem to be an obvious compromise. || 

(v) When a visual field, which is so dimly illuminated by light of | 
any wavelength between 400 mp and 580 my as to be stimulating ]} 
rods only, has an adjacent field illuminated with a visibly yellow, |} 
orange or red light (thus stimulating cones), it is no longer seen as a 
colourless field but as a darker and a blue field (see p. 152(18)). The | 
colour developed is independent of both the wavelength used for ]} 
stimulation of the rods (i.e. between 400 my and 580 my) and also |} 
of that (580 mp to 700 my) used for illuminating the cones (Willmer, | 
1949a). [he darkening suggests the action of laterally-operating | 


inhibitory processes initiated by the cones. Incidentally, but of }}} 
particular relevance to a point that will be discussed later, since the |} 
field illuminated by the long-wave part of the spectrum does not |f} 
overlap the area in which the rods are being stimulated, the blueness 
must arise, at least in part, from the stimulation of the rods for these |} 
are the only elements delimiting the size and shape of the field; but | | 
it also depends on effects spreading laterally from cones stavulecd i 
in adjacent areas in the retina. 

(vi) The raising of the violet threshold by an adjacent red light jf} 
(see p. 1§2(19)) is probably a similar phenomenon and could indicate |} 
inhibition by the cones of a mechanism particularly sensitive in the 
blue and violet of the region of the spectrum. 

(vit) Whatever may be the exact mechanism of the inhibition of 
rods by cones it would operate differentially throughout the spectrum, 
more at the red end and less in the blue region. In daylight conditions, |] 
therefore, rods might be so effectively inhibited that they could only ff 
contribute in the blue region of the spectrum where the action of the |] 
cones would be relatively much weaker. Throughout the rest of the 
spectrum the rod pathway might presumably give no response at all, |}} 
though the rods themselves might still be photochemically activated. 
The maximum sensitivity of the blue mechanism at 450 mu and. | 
not at 500 mu (where scotopic rods have their maximum) would ]} 
thus seem to be a probable consequence of their inhibition by cones. ] 

(viii) The inhibition of rods by cones is not inconsistent with the ] 
photochemical effect on the rods becoming at the same time saturated, | 
or maximal, both at high brightness and in those spectral regions in |} 
which the rods are most sensitive. Nor is it inconsistent with the |] 
simultaneous occurrence of self-inhibitory processes as outlined |} 
above, nor with an opposite inhibitory action of rods on cones. All ]| 
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four processes could be in operation at once and they would give to 
the photopic rod mechanism many properties (including spectral 
sensitivity) which would be different from those possessed either by 
the scotopic rod-pathway or by the cones. 

(ix) More direct evidence for the inhibition of rods by cones 
comes from the study by Elenius & Heck (1957) of the changes in 
size of the B wave of the electroretinogram during the process of 
dark-adaptation in comparison with the increase in the visual 
purple content of the eye. In darkness, the latter starts to rise almost 
immediately, while the former shows no change at first and then a 
rather quick rise, which eventually subsides and thereafter coincides 
with the increase in visual purple. The authors interpret the sudden 
rise in the B wave after the initial delay, as being caused by the 
release of the rod activity, which up to that time had been suppressed 
by the cones. Their main reason for this interpretation is that there 
was no delay in the rise in the B wave in a subject who was totally 
colour-blind (i.e. a rod-monochromat), and this subject’s B wave 
followed the regeneration of rhodopsin rather closely. 

Since the electroretinogram, certainly in the frog and probably 
elsewhere also, has its origin in the immediate vicinity of the rod and 
cone cells (Brindley, 1958), the antagonism between the rods and 
cones must occur in this layer, either directly or perhaps through 
the intervention of horizontal cells. The pictured structure of the 
latter in primates (Polyak, 1941) is certainly consistent with their 
establishing connections between cone pedicles and rod terminals. 

By comparison with the action of cells in a comparable position in 
fishes (see p. 144) the horizontal cells could perhaps form the basis 
for a blue-yellow mechanism if, when rod activity was in excess, 
they became electronegative, and if they became electropositive 
when the cones became dominant. 

(x) Study of the electroretinogram under photopic conditions 
shows that, over much of the spectrum, the usual large “‘scotopic”’ 
B wave is very much reduced and is obscured by a quicker diphasic 
response (Adrian, 1945) composed of a negative A wave and a 
positive X wave, of which the latter is particularly associated with 
“red” cones (Armington, 1953). Nevertheless, there are traces of 
something which could be interpreted as a slower B component, 
particularly in the blue-green, blue and violet parts of the spectrum, 
even at quite high photopic levels (Armington, 1953; Johnson & 
Cornsweet, 1954). This component increases with increased light 
intensity slightly more quickly than the positive wave in the red end 
of the spectrum. Moreover, when, under photopic conditions, the 
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intensities needed to produce a given positive response are plotted | 
against wavelength, the electroretinogram is found to be very much. 
more sensitive in the blue region of the spectrum than would be 
anticipated from the standard luminosity curve. This extra sensitivity | 
in the blue is somewhat less pronounced in two congenitally night- ]} 
blind subjects, in parallel with their scotopic responses. All these ]} 
factors are consistent with the view that a contribution from the ]} 
rods is present in the normal photopic electroretinogram, and that } 
it is particularly evident in the blue region. Presumably rods are) 
still active photo-electrically, and particularly so in the region where | 
cones are relatively inactive. 


From all that has been said so far it would appear that there is a prima 
facie case for connecting the blue mechanism with rods. The other side of 
the picture must therefore be examined. 

Three main arguments have been raised against rods participating in 
colour vision as one of the receptor mechanisms. The first lies in the 
observation that the spectral sensitivity curve for rhodopsin is not con- |f} 
sistent with colour mixture data, for it cannot be obtained by any linear 
transformation of the colour mixture curves. However, since the effective 
response which can be made by the rods in photopic conditions may 
depend on (a) inhibition by cones, the extent of which may itself vary in a 
non-linear manner and only start above a certain threshold, (4) inhibition |ff 
by the rods themselves, thus altering, in a non-linear manner, the effec- 
tiveness of any inhibition by the cones, (c) the possible ‘“‘saturation”’ of the | 
activity of the rods over quite a large part of the spectrum so that their | 
response is uniformly maximal (apart from inhibition) over this range, | 
(d) the extent to which each of the two types of cone contributes to any, 
inhibition which they bring about, and (e) the possible inhibitory action of |] 
rods on cones, this argument about the shape of the sensitivity curve may | 
not be so formidable as it appears at first sight. Rods, if they contribute 
to vision in photopic conditions, must clearly have their normal activity |} 
very much modified. H 

‘The second main argument which has been raised against the idea that |] 
rods participate in colour vision is embodied in the dictum that “when two| 
colour-fields match in every respect the receptors in those fields must be. 
stimulated to exactly the same extent in both fields’. Thus, if rods form one |} 
group of those receptors, a match made under photopic conditions should |} 
still hold when the intensity is reduced to scotopic levels and the rods are |} 
then functioning alone. In practice, many trichromatic matches made 
under photopic conditions break down when the brightness is changed and | 
especially under scotopic conditions. For example, the match between 
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two complementary whites, made respectively by mixing red with blue- 
green and blue with yellow, no longer holds when sufficiently dimmed. 

The third main argument against the participation of rods in colour 
vision is the presence of the Stiles-Crawford directional effect in the blue 
region of the spectrum. Light falling obliquely on the cones has been 
found not to stimulate them so efficiently as where it falls in the direction 
of their long axes. This difference is not observed when light falls on rods. 
If therefore the rods are the receptors for the blue end of the spectrum it 
might be thought that the directional effect would not be observed with 
blue light, yet it is actually found to be present. Since, however, it is prob- 
able that for the perception of colour at least two mechanisms must be 
active at the time, and one of these must be cones, the difficulty does not 
seem to be immediately insuperable. The problem will be discussed in 
more detail on a later page (see p. 176) where it will be seen that the ob- 
served data are consistent with the rods being involved in the blue mech- 
anism. 

On the one hand, therefore, there is much circumstantial evidence 
suggesting that rods participate in colour vision and, on the other, there 
are at least three strong arguments against the idea. Either the rather 
formidable array of evidence already cited and the hypotheses to be 
outlined in the later part of this paper must be explained away as pure 
coincidence, or in some other manner, or else a satisfactory explanation 
must be found for such observed facts as the breakdown of photopic 
matches in scotopia, for the presence of a Stiles-Crawford effect in the blue, 
and for the failure to relate the sensitivity of the blue mechanism directly 
to the photosensitivity of rhodopsin. 

While it cannot be pretended that these explanations are immediately 
forthcoming, there are several points which should be kept in mind in 
assessing the significance to be attached to them. 

First it should be emphasized that two of the arguments are based on 
data obtained from colour matches. Such matches are made by the brain 
on the basis of information passed up to it along the optic nerves. As the 
result of various adaptations, summations and inhibitions, which may go 
on in the retina and elsewhere, the information reaching the brain is not 
necessarily the same as that provided by the receptors themselves. For 
example, inhibition exerted by the cones at the red end of the spectrum 
could conceivably be so intense, that over a large part of the spectrum, the 
rods, though physically stimulated, could make no contribution to the 
perceived colour. By using other pathways, e.g. those connected with 
reflexes, rods might still continue to contribute visual information in other 
ways. This would depend upon the exact seat of the inhibitory process, 
but from the colour point of view the inhibition of rods by cones could be 
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in some ways as effective in eliminating rods from contributing to matches } ] 
in the long wave end of the spectrum as a pre-receptoral colour-filter }} 
could be. H || 

Secondly, it must be emphasized that a particular level of response in a} 
retinal nerve fibre leading from the rods could be determined by :— 


1. the actual sensitivity of the rods; 
2. the effects of lateral summation or inhibition by neighbouring rods; }} 


3. the effects of inhibition by cones (probably of two sorts), including }} 
the probability that the performance of the cones is not entirely ] 
independent of that of the rods, i.e. inhibitory and summative | 
processes may be mutual; | 

4. the complete ‘‘take-over” of the pathway at high intensities by the |} 
cones, as has been observed in similar systems in animals (Granit, |) 
1949). This could be restricted to one part of the spectrum. 


Thirdly, the precise meaning to be attached to the observation that two |] 
fields match needs to be very carefully considered before it can be accepted |f/ 
that a match necessarily means equality of stimulation of the receptors. 
In the following cases it clearly does not. 

(i) If two separate white fields on a black ground are made to match in 
brightness it could be assumed, for the sake of argument, that the receptors 
involved by the two fields are being equally stimulated. If one of the fields 
is now surrounded by a white field which is brighter than itself, then the |} 
original fields will no longer match. The white surround might perhaps be |}, 
expected to spread on to the original field and so to increase the brightness |} 
of the latter, but, in fact, it is this field which becomes the darker of the |} 
two; so this effect of stray light can probably be neglected. Indeed, the |] 
field which is surrounded by white may have to have its intensity raised as |} 
much as five times before a match is again achieved (Grindley, 1959). jf) 
Under these conditions, therefore, two fields are now regarded as matching |] 
even though the amount of photochemical change in the receptors of one |] 
field is likely to be some five times greater than that in the other. Pre- | 
sumably the effects of this extra excitation are reduced by some form of |} 
lateral inhibition from receptors stimulated by the light surround. It is | 
true that this white surround alters the conditions for the match but with 
ordinary conditions of colour-matching, with adjacent fields on a black 
ground, it seems to be at least possible that one field might well influence 
the effects that can be observed from the other. For example, if one side 
was illuminated with a colour mixture which involved a small but signifi- 
cant response from one of the three colour mechanisms, then the extra 
“inhibition” spreading from the adjacent field might well eliminate this | 
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small response altogether and the match could then be made on the basis 
of the remaining two mechanisms. 

(11) In somewhat the same category as the observations just made is the 
observation that a yellow field of limited size can, when surrounded by 
blue, be made to match a similar white field not surrounded by blue. Here 
there is some sort of mutual interaction because, not only does the blue 
take the “‘yellowness” out of the original yellow, but the blue which 
changes the yellow to white may not itself be distinguishable from the 
black surround. One wonders how far ‘“‘yellowness” in one field affects 
“blueness” in an adjacent field, and wice versa, in ordinary colour-matching 
procedure. The effects need not necessarily be symmetrical on a quantita- 
tive basis. 

(11) ‘The sensation of white can be aroused in a variety of ways. Scotopic 
white (stimulating rods only) in one eye can be made to match photopic 
white (predominantly affecting cones) in the other eye. Obviously this 
match does not depend upon the receptors being equally stimulated on 
the two fields. 

A small yellow field (subtending less than 30’) falling entirely on the 
central fovea (probably affecting two classes of receptors) can be matched 
with a white field in the periphery (probably affecting three types of 
receptors). Both appear white even though there are probably no func- 
tional blue receptors in the central fovea. It will be remembered that in 
ordinary colour matches yellow can be made to appear white by the 
addition of blue, thus presumably involving the “blue mechanism’. 

In red-green colour-blind subjects any wavelength affecting the cones 
of the central fovea, which in such subjects behaves as if it were mono- 
chromatic, can be matched both with white, as seen by the dichromatic 
periphery, and with a scotopic white as seen by the other eye. 

The perception of white, however, should perhaps be regarded as a 
special case and not entirely relevant to a discussion on colour mechanisms. 

(iv) On the central fovea, a red field (about 100 x threshold) can have 
a blue light of at least 10 x threshold added to it before becoming detect- 
ably different in hue. A similar effect is observed when red is added to 
blue. In terms of photosensitivity, therefore, matches may not always be 
very precise. Under scotopic conditions this amount of blue light might 
prove a very strong stimulus to the rods, and its presence or absence 
would be very easily detectable. 

(v) Matches made with contiguous fields are easier to make, more stable 
and sometimes different from those made with separated fields. ‘This 
difference probably indicates the prevalence of lateral effects (see (i) above) 
and these are almost certainly post-receptoral in origin. The critical fusion 
frequency of a small white field can be altered either upwards or down- 
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wards by the juxtaposition of another constant field which is either darker] 
or lighter respectively than the flickering field (Graham and Granit, 193 1). 
We do not yet know how far these lateral effects spread. They are clearly i 
more influential on small fields than on large ones but some of them may }} 
spread for some distance and the “‘eye”’ has a habit of making fields uniform |} 
if it can. || 
For reasons such as the above, and in view of the arguments developed} 
elsewhere in this paper in favour of the participation of rods in the blue] 
mechanism, it would seem to be desirable to re-examine the concept of'}} 
colour-matching on adjacent fields (i.e. standard colorimetry) rather more] 
critically in terms of what physiological information such matches actually } 
provide about the mechanisms involved. Perhaps some of the so-called | 
matches, which are nearly always made on contiguous fields as the result} 
of considered judgments with unlimited time and opportunity for adapta- |} 
tion and equilibration, should be re-investigated more dynamically, e.g. 
by reversing the fields, by flicker methods, by alternating fields, by using jf} 
white surrounds and so on, in order to try to assess the parts played by|f) 
direct photosensitivity, on the one hand, and by retinal integration and |} 
adaptation on the other. 
The first steps in such an analysis, by more dynamic methods, have #f/ 
been taken by Bongard, Smirnov & Friedrich (1958) and the results are’! 
interesting. They have made matches by the method of replacement, i.e. 
the observer views one field after the other, on the same area of retina, and |) 
the match is regarded as complete when no change is observed on replacing i 
one field by the other. Their experiments show that while trichromatic |] 
matches made in the periphery often break down seriously on changing the 
brightness level, matches made by using four primaries do not do so, 
One can only suppose from this that the matches made with only three: ' 
primaries are not sufficiently precise and that the small unnoticed differ- |} 
ences are magnified by changes of brightness. Moreover, since the more. 
precise matches made with four primaries were found to be independent of | | 
brightness level, including the scotopic, it follows that the scotopic} 
receptors must contribute to the photopic colour sense of the periphery. | 
The rods, on numerical grounds, are likely to play a more prominent part | 
in peripheral vision than they do in foveal vision and, in consequence, the |] 
periphery is likely to be a more sensitive testing ground for observing their | 
effects than is the fovea. Matches made with only three primaries on central 
fields of less than 1.5° were found to be almost stable, and this may simply | 
be that they can be made initially with the accuracy necessary to stand up |} 
to wide intensity changes. 
The eye is a surprisingly precise instrument, but it is also a very versatile | 
instrument and presumably there must be several features in its make-up 
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which mean compromise in performance. Although mathematical equations 
can be accurately applied to certain aspects of retinal performance, it 
should not be presumed that all the variables are sufficiently known for 
their application to be extended indefinitely. The fact that the law of 
additivity shows a progressively increasing error of as much as 14°%, from 
the red towards the blue end of the spectrum (Federov, 1958) should 
perhaps be a sufficient warning. There are many features of vision which 
indicate that the retina always tends to reduce to a minimum the messages 
which it transmits to the brain. It achieves this end largely by adaptive 
and inhibitory processes, and the resultant impulses in the optic nerve 
emanate mostly from those stimuli whose effects cannot be further reduced 
by adaptation or inhibition. Such impulses are seldom the result of the 
direct stimulation of the receptors only. 

It may be concluded therefore that, while the arguments against the 
participation of rods in colour vision are certainly formidable, the evidence 
upon which they rest is not always above suspicion, and in view of the 
numerous indications that a mechanism, as different from the red and 
green mechanisms as is the blue mechanism, could well have its origin in 
the rods, it is of interest to pursue the matter further and to investigate 
what would happen if cones inhibited the rods on the lines which have been 
suggested above. The results of a tentative scheme are set out in the 
following pages. 


An Hypothesis concerning the Action of Rods in Photopic Vision and 
the Nature of the Blue Mechanism 

Thus in spite of certain strong negative arguments which, though 
extremely formidable, need not yet be regarded as prohibitive, it seems to be 
worth while to explore the possibility that the blue mechanism is basically 
the result of the inhibition of the rods by the cones. Especially is this so 
since the recent observations of Hartline and his colleagues on the eye of 
Limulus (Ratliff, Miller & Hartline, 1958) have given a very clear indication 
of the way in which such inhibitory processes may work and of the factors 
which may be involved. In the eye of Limulus they have shown that 
neighbouring ommatidia (the receptor units in the compound eye) 
mutually inhibit each other according to a quite definite pattern which 
depends on the extent to which the ommatidia are each directly stimulated 
and on their distance apart. If 7, and 7, are the frequencies of impulses 
initiated by the two receptors when each is separately stimulated, then the 
final responses R, and R,, when both receptors are stimulated together and 
after mutual inhibition has occurred, are expressed by the equations 


Ry =1, — Ay(r2 — rx ); 
where k, is a constant depending, at least partly, on the distance between 
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the ommatidia and r,, is the threshold response necessary for exerting any | 
inhibition at all, and 


Ry = 15 — Rolie Tay) 


where the connotations are similar. 

Though in the human rods and cones the inhibition may perhaps be | 
more properly regarded as mainly a direct antagonism, if it does not occur 
through the intervention of horizontal or amacrine cells, it, too, is likely ] 
to be an antagonism between the effects produced by the receptors as the 
result of stimulation; the magnitudes of these effects, like the impulse | 
frequencies in Limulus, are likely to vary with the logarithm of the stimulus |} 
itself and with such factors as the relative numbers of the two types of 
receptors and their distance apart. 

Thus, although the inhibition may be brought about by a different 
mechanism from that studied by Hartline and his colleagues in Limulus, 
it is likely to follow very much the same laws. 

As a first approximation, and neglecting the possible reciprocal reduc- 
tion of cone activity by the rods, no very great error is likely to be com- |} 
mitted if the equation B = R — K (C — C,) is applied to the antagonism 
of rods by cones, where B is the resulting response from rods, now to be 
considered as the basis for the “blue mechanism’’, R is the immediate 
response from the rods, C that of the cones and C, the threshold response 
at which the cones begin to inhibit the rods. In other words, the question | 
at issue is whether it is possible, by choosing a suitable value of K, to |) 
derive a spectral sensitivity curve consistent with that postulated for the |] 
blue mechanism by starting with the absorption spectrum of rhodopsin, |}, 
as giving a close approximation to the sensitivity of the rods, and subjecting 
it to an essentially subtractive process based on the spectral sensitivities of | 
the cones, when all the curves are plotted on a logarithmic basis and an |} 
arbitrary threshold is chosen from which to measure the rod and cone |] 
responses (see Figs. 1 to 4). | 

If all curves of sensitivity against wavelength are plotted on a logarithmic | 
basis and an arbitrary base line is drawn joining the limits of the visible |] 
spectrum, then the ordinates measured from this base line will, other things ] 
being equal, give a measure of the response of the units at each wavelength, | 
and by analogy with Hartline’s data these responses can be subtracted. 

The first problem which arises in plotting these curves is to decide which ] 
cones shall cause the inhibition of the rods. 

This difficulty can be to some extent overcome by first examining the 
hypothetical cases for the colour mechanisms of the protanopes and deuter- |] 


anopes where either P cones or D cones occur in isolation or at least very |} 
heavily preponderate. 
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The second problem is to decide exactly how sensitive rods are with 
respect to cones. Flamant & Stiles (1948) have suggested on the basis of 
the directional sensitivity of cones and its absence in rods that, in the 
normal eye, cones begin to show their dominance at wavelengths longer 
than about 610 my. In Figs. 1 to 3 the sensitivity curve for rods has 
therefore been linked with the P and D curves (themselves linked at 574 
my) at 610 mu. The P and D curves are those derived from the spectral 
sensitivity curves of the monochromatic foveal centres of protanopes and 
deuteranopes and probably represent the spectral sensitivity of the two 
types of cone in the normal eye (Willmer, 1955). 

Thirdly, it is necessary to decide where the thresholds shall be placed. 
Since the spectrum is normally visible from approximately 400 mp to 700 
my the line T (Fig. 2) can be taken as a reasonable threshold from which 
to calculate responses, and the line 7, as the threshold for the inhibitory 
effect of the cones. 

In Fig. 2, curves showing the probable effect of P-cone antagonism on 

the response set up in the rod pathway are plotted for various values of 
K between 1 and 2. The sensitivity curve of the 7, mechanism (one of the 
“blue” mechanisms) as determined by Stiles (1953) is plotted for com- 
parison. 

It is clear from this figure that the blue mechanism of the protanope 

derived in this way could, according to the actual value of K, have a 
spectral sensitivity very closely approaching that which is postulated on 
experimental grounds, and the two mechanisms, i.e. the inhibited rods 
(B, mechanism) and the cones (C,, mechanism) would yield the same 
response at about 490 my. This is the observed colourless or neutral point 
for most protanopes. 

Similar treatment of the situation, but substituting the cones of the 
deuteranope for those of the protanope, yields the curves in Fig. 3. Stiles’s 
m3, mechanism (the other ‘‘blue mechanism’’) is plotted for comparison. 
Again the neutral point could well fall within the right range (see curve ITT) 
for the observed neutral point of the deuteranope, 495-510 my, is generally 
more variable than that of the protanope. 

These curves, based on the data for red-green blind subjects, therefore 
suggest, if they can be applied to the normal eye, that the blue mech- 
anism may normally be some resultant of both these processes, and that 
the 7, and 7, mechanisms each result from the interaction of one of the 
two types of cone on the rod mechanism. The average sensitivity of the 
blue mechanism in the normal eye probably lies somewhat between that 
of the protanope and that of the deuteranope. 

Since the two types of cones are probably about equally stimulated at 

= 578 muy (the colourless point in central foveal vision when only P 
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and D cones are involved) the sensitivity of the blue mechanism deter- | 
mined after adaptation to this wavelength may give a fair indication of its | 
spectral sensitivity in the normal eye. Fig. 4 shows Stiles’s (1953) estimate 
for the increment threshold curve under such conditions in comparison | 
with the mean of the curves for the “‘blue mechanism”’ of the protanopes, 
and deuteranopes as theoretically derived above and which result when, 
RSs: 

It must be remembered that in all these problems the assumed logarith- | 
mic relationships between stimulus and response are only approximate 
and that minor modifications of the curves will almost certainly be} 
necessary. Strongly stimulated cones probably produce relatively more 
inhibition, as compared with those near threshold, than the strictly 
logarithmic relationship would predict. Hartline’s figures indicate that the; 
amount of inhibition is closely related to the frequency of impulses and it} 
must be remembered that this only approximates to the logarithm of the 
intensity of the stimulus. There is also, for example, some indication from 


stimulus to the D receptors are greater than those of a corresponding) 
increase on the P receptors. Secondly the rods themselves may have 
reached their maximum rate of response when strongly stimulated inf 
photopic conditions, and it is by no means certain how the antagonism) 
of the cones would then affect the response in the residual “‘blue pathway”. 
‘Thirdly the minimum cone response necessary to produce any inhibition) 
or antagonism at all is uncertain. In Limulus a definite threshold appears} 
to be necessary before inhibition occurs, and, obviously, if inhibition) 
depended in any way on the frequency of impulses in the opposing cell, 
then there could be no inhibition till at least one impulse was propagated. 
In the human retina this clear-cut threshold may not be required if theif 
antagonism is more direct, but nevertheless there probably is a threshold} 
of some sort and, in the example given, this has been arbitrarily fixed by 
the line T;. The arbitrary choice of K to achieve a best fit takes account}, 
of some of these snags but probably not of all. For example, no account! 
is taken of any reciprocal inhibition of cone activity resulting from stimu-} 
lation of the rods. H | 
In constructing these curves a direct and complete antagonism has been, 

assumed. As alternatives, it may be suggested that it is possible that the 

cones exert just sufficient inhibition on the rods to reduce their response toi} 
equality with that of the cones themselves, or, again, that the cones# 
effectively take over the rod pathway as soon as their response exceeds thatif 
of the inhibited rods. Some of the results obtained by Granit (1949) on theif 
impulse frequencies from ganglion cells in the cat suggest the possibility: 
of this latter alternative; also the spectral sensitivity of the positive wavelf 
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in the photopic electroretinogram is consistent with the idea. It is, how- 
ever, more difficult to harmonize the consequences of this take-over with 
other data and the idea will not now be further considered, though it 
would effectively eliminate rods as an independent set of receptors in the 
red part of the spectrum, which is one of the essential conditions if they are 
to be considered as an integral part of the blue mechanism. 

At this stage it may be permissible to digress somewhat and to consider 
how the sensitivity curves obtained in the manner outlined (Fig. 4) 
describe the colour vision of protanopes, deuteranopes and normal people. 
The results are of interest, though they do not help to show that the 
curves are necessarily the right curves. 

Hecht & Shlaer (1936a and b) and Judd (1948) have investigated the 
nature of the colour responses of protanopes and deuteranopes and have 
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Wavelength (mjz) 
Fic. 5. Theoretical saturation of colours for deuteranopes (D) and for protanopes (22) 
D. The difference between the ordinates of the blue mechanism (i.e. inhibited rods) 
and the D mechanism (see Fig. 3). 
P. The differences between the ordinates of the blue mechanism and the P mechanism 
(see Fig. 2). es | 4 
The curves show the limits of wavelengths within which the colours would be expected 
to be indistinguishable on the basis of a detectable intensity difference of 0-2 logy, units. 


shown that subjects in both these classes see the spectrum in terms of 
different saturations of blue or yellow, with the change of hue occurring 
at a ‘neutral point” lying in the part of the spectrum which is blue-green 
to the normal eye. In the foveal centre of a normal subject a similar di- 
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— 


chromatic situation occurs, giving rise to sensations of red and blue-green} 
(Konig, 1894). There, the saturation of the red or blue-green colour see 


at each wavelength can be effectively described by the difference between 


i 


cal 


Log brightness ratio of primary to white 


450 500 550 600 
Wavelength (mye) 


Fic. 6. Observed saturation of colours for deuteranopes (D) and for protanopes (P) (asf 
depicted by Hecht & Shlaer, 1936b). The curves show the limits of wavelength withi 
which the colours are indistinguishable (cf. Fig. 5). 


the ordinates of the P and D curves at that wavelength (Willmer, 1955).if 
With respect to the red-green blind subjects, therefore, the saturation of the 
spectral colours for protanopes and deuteranopes may well be described by 
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420 460 


noes 
500 540 420 460 
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Fic. 7. Hue discrimination curves for protanopes and deuteranopes. | 
Continuous curves. Theoretical curves based on the rate of change of the difference} 
between the ordinates of the B and P curves and the B and D curves respectively, with| 
wavelength. 
o — — 0, hue discrimination curves as observed (data from Wright, 1946). 


the differences between the ordinates of the B and P curves, and the B and |f 
D curves respectively. Such differences are plotted against wavelength in} 
Fig. 5 and corresponding figures for the actual saturation as determined by 
Hecht & Shlaer (1936b) are plotted in Fig. 6. The similarity is obvious, 
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though the “fit” is by no means perfect. Cells affected by rods and cones in 
the manner of the cells in the retinae of fish, described by Svaetichin and 
his colleagues and mentioned earlier (see p. 144), ice. negative-going in 


Blue 
green Yellow 


oa ag ee ee] 
Violet! Indigo !Blue! } Green ! Brown : Red 
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Saturation 


420 460 500 540 580 620 660 700 
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Fic. 8. Diagram to show how the activity of cells (A and B) detecting the difference 
between the responses of the rods and the cones, after the rods have been inhibited 
by the cones, could combine with similar cells detecting the difference between the 
responses of the P and D cones, to give the spectral colours as normally observed. The 
magnitude of the difference, i.e. the ordinates in this figure represent the degree of 
saturation of the various hues at each wavelength. 


Activity of cell A 
Blue-yellow mechanism. 


\ 


a Se 


Vertical bars (B — C), giving saturation of indigo (?), the colour seen, for instance, 
after adaptation to intense yellow. The ordinates are measured from the lower 


curve as a base line. 

Horizontal bars (C — B), giving various saturations of yellow, the colour seen by 
deuteranopes and protanopes at the long-wave end of the spectrum. The ordinates 
are measured from the lower curve as base line. 


Activity of cell B 
Red-green mechanism. 


a 


3) Se 


Crosses (P — D) giving various saturations of blue-green as seen in isolation at the 
foveal centre. 

Squares (D — P) giving various saturations of red. The blue-green response 
occurs in isolation at about 500 my and the yellow mechanism in isolation at about 
574 my; variation in the relative sensitivities of the P and D mechanisms would 
account for the variable position of yellow in the spectrum as determined by 
different people. 


It is possible that the D cone may be more sensitive than the P cone in the extreme 
iolet thus accounting for the purple quality. 


A — combined with B — yields blue 
A — combined with B + yields purple 
A + combined with B — yields orange 
A + combined with B + yields green 
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response to rods and positive-going in response to cones, would seem tal} 
provide a suitable mechanism for such simple colour discrimination. 

Hue discrimination for dichromats, apart from the change-over froma} 
blue to yellow, means the rate of change of saturation with wavelength orf 
the rate of change of the difference between the ordinates of the B and F 
and of the B and D curves. Theoretical and observed hue discrimination 
curves are shown in Fig. 7. Again the similarity is striking. | 
If the normal eye combines both this cone v. rod mechanism witha} 
the P cone v. the D cone mechanism characteristic of the central fovea, 
the effects can be judged by reference to Fig. 8 where different shadings] 
indicate the different sensations and the ordinates represent the satur-} 
ation of the various sensations. It can easily be seen that the seven different 
spectral hues result rather satisfactorily from combinations of the inform 
ation supplied by two differentiating mechanisms. ‘The origin in this wa 
of an apparently tetrachromatic system from what is essentially a tri 
chromatic system is thus an intriguing method of making the Young 
Helmholtz theory more acceptable to the followers of Hering and at the 
same time resuscitating the Ladd-Franklin and Géthlin hypotheses whichif 
have such attraction for those interested in the evolution of the colou 
mechanism. 

The next problem which arises in connection with the blue mechanismif 
is to explain how it contributes to luminosity and why its contribution is so 
small when the ordinary 1° or 2° matching fields are used. 


i! 
if 


central foveal luminosity curve could be derived from the P and D sensi 
tivity curves if the contributions of each were weighted by a factor whic 
was probably connected with the relative numbers of each type of receptor} 


illustrated on p. 6 of that paper. 

By applying this principle to the three-unit system suggested above, i.e.| 
consisting of the residual rod path (blue mechanism), the P path and theif 
D path, it is possible to construct luminosity curves for the central foveal 
(30, 50’ and 2° fields), and for fields situated 40’, 10°, 15° and 25° off-f 
centre. These can then be compared with the luminosity curves actually} 
recorded for these areas (Thomson & Wright, 1947; Weale, 1951, 1953).\) 
The relative numbers of rods and cones for each of these areas can bef 
roughly estimated from the data of Osterberg replotted by Pirenne (1948).\) 
When this is done it is found that the theoretical luminosity curves 
obtained follow the same pattern as those actually observed; that is to say, | 
with increasing distance from the fovea a peak of sensitivity progressively} 
develops at about 460 my and a relative depression occurs at 500 mu. How-# 
ever, the effect of the rods in raising this peak at increasing distances}} 
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Fic. 9. Theoretical and observed luminosity curves at different retinal positions and 
with various sizes of field. The theoretical data are plotted according to the data given in 
the appendix. The results are given as the continuous curves. The open and closed circles 
represent the luminosities actually observed by Thomson & Wright, 1947; Weale, 1951, 
1953; and Willmer, 1955. 

Curve Centre of field Size of field JENUD, C/B 
° / 


I ° 30 BiB (oe) 
2 Cy 50’ 5/6 100/1 
3 oy 2 1/1 43/1 
4 40’ L5< 1/1 31/1 
5 10° 50° 2/1 1/1-8 
6 150 50° 3/1 1/5 
7 25° 50° 4/1 1/4-3 


P/D = Proportion of P cones to D cones in the area, : 
C/B = Proportion of cones to receptive fields for rods in the area (see Appendix). 
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from the fovea is greater than that required to account for the manner in} 
which the peak actually develops on the observed curves; i.e. the inhibited} 
rods would contribute to the luminosity curve in the manner expected but } 
too intensely. In making this comparison the actual numbers of rods and | 
cones, as estimated by Osterberg, were used. It is, however, arguable that 
the unit contributing to luminosity is not the individual rod but rather the: 
cell which relays the effects of the stimulation of the rods, and it is possible] 
that numerous rods impinge upon this cell. Thus the number of receptive 'f} 
fields for inhibited rods should perhaps be compared with the number of } 
receptive fields for the cones and not just the number of rods with the} 
number of cones. Since the acuity of the blue mechanism has been esti- }/ 
mated by Brindley (1954) to be about 7-5’ while that of the red and green | 
mechanisms is about 1’, and the Ricco areas for blue at 13’, and 4’ for red, 
it would seem not unreasonable to suggest that, instead of comparing the 
absolute numbers or rods and cones, 7°5 rods should be regarded, as aif 
first approximation, as equivalent to one cone. In Fig. 9 are plotted the 
observed luminosity curves at different points in the retina together with |} 
the theoretical curves for the same areas. The latter have been constructed 
on the assumption that luminosity, when receptors of different sorts} 
combine, depends more nearly on the numbers of receptive fields than on 
the actual numbers of receptors. Certainly the agreement between the # 
theoretical and the observed curves is again rather striking, and suggests 
that the distribution of the blue mechanism in the retina follows quite 
closely the distribution of the rods, or rather of their receptive fields. 


Discussion 


On the basis of the observations outlined above, there would thus seem 
to be good grounds for believing that the blue mechanism is not un-|f 
connected with the photopic activity of the rods. If rods are inhibited by. 
cones, the two blue mechanisms, which have been demonstrated by Stiles | 
(1953), would seem to fit with expectation since there are certainly two. 
types of cone and each type must begin to exert its inhibitory action on the |} 
rods at an intensity level dependent on its spectral sensitivity. Thus there |] 
are at least three factors that contribute towards the sensitivity of the| | 
blue mechanism. They are:— 


(i) the sensitivity of the rods themselves, 
(i1) the extent of the inhibition by the P cones, 
(ii) the extent of the inhibition by the D cones. 


Moreover, all of these may be independently subject to change as a con- |} 
sequence of adaptation and mutual inhibition. Probably the nearest 
approach to the mean spectral sensitivity of the blue mechanism is that. 
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obtained when the cones are both equally suppressed by adaptation to a 
high intensity of light of some wavelength near 574 mu (Fig. 5). At present 
there is no means of knowing how effective the inhibition by the cones is, 
but the luminosity curves suggest that it is fairly general in all areas of the 
retina. The effects on the electroretinogram indicate that it occurs at a 
neural level near the receptors, and must therefore either be direct, or 
perhaps mediated by horizontal cells since Polyak’s (1941) figures suggest 
that their structure is adapted for just such a purpose. 

There are certain difficulties in the hypothesis outlined above. For 
example, the luminosity curves are derived by linear summation of the 
“responses” from “‘inhibited rods” and from the two types of cones, while 
the rods are inhibited according to Hartline’s rule which involves cal- 
culating the inhibitory effects in terms of the logarithm of the sensitivity. 
Barlow (1958a) has shown that within a receptive field of a ganglion cell the 
effects of two separate stimuli summate directly and not on a logarithmic 
basis. This is the same effect as has been used above in calculating the 
luminosity curves. Thus there are precedents in eyes for both the mech- 
anism of inhibition and for the summation postulated above, but it is not 
easy to see exactly how both can operate simultaneously. 

Similarly it is not easy to forecast what will happen to the blue mech- 
anism in the mesopic range, i.e. in twilight vision when the cones are barely 
stimulated. The extent of the inhibition must obviously decline, but 
whether this affects all rods alike or whether some rods, perhaps those less 
easily affected by cones, are completely released from inhibition before 
others and then contribute as ordinary visual purple receptors, as they 
must all do in complete scotopia, cannot yet be decided. 

To the credit side of the hypothesis it may be mentioned that the 
different rates of adaptation in the three mechanisms seem to provide 
satisfactory explanations for several anomalies of the blue mechanism. 
For example, after adaptation to yellow the sensitivity to blue is, oddly 
enough, extremely strongly depressed, but it rises very rapidly in the first 
few seconds, much more rapidly than the sensitivity to red and green. This 
could be explained on the grounds that the rods of the blue mechanism are 
still under active inhibition by the cones for the first few seconds (see the 
data given by Elenius & Heck, 1957), and some release of this occurs as 
the cones cease to respond to the stimulus. Curiously enough, to the eye 
that has been adapted to yellow light or even to blue light, all colours 
appear to be more blue than they do to the other eye. This presumably 
means that while both cone responses are decreased, the blue response 
though made absolutely much less is changed relatively less. These 
phenomena are also somewhat similar to that found by Stiles (1949) in the 
recovery of the blue mechanism after adaptation to red light. At first the 
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blue mechanism appeared to be relatively little affected by the adaptation, | 
but after a few seconds, as the red and green mechanisms recovered, the 
blue mechanism became even less sensitive than the green. Presumably 
the inhibitory action of the cones then began to assert itself. 
As mentioned earlier, a sensation of brown is evoked from a spectral | 
yellow or orange by merely dimming one half of a divided field of these | 
colours. It certainly becomes easier to understand why the blue mechanism | 
should become involved by dimming the yellow if dimming accentuates ] 
a lateral inhibitory process such as that suggested as necessary for the blue | 
mechanism, which originates in the brighter (yellow) field and extends | 
over the dimmer field. | 
In all the work on increment thresholds, where one field is surrounded 
by another, it is unfortunate that, up to the present, the nature of the 
observed increment is seldom clear. Detectable differences in luminosity, 
saturation and hue can all have different causes but any one of them could 
make the stimulus visible. Moreover, even if the increments are all purely 
luminosity increments, abrupt changes of luminosity could arise when each | 
of the following units reaches threshold: (a) rods, (b) P cones, (c) D cones, | 
(d) P cones inhibiting rods, (e) D cones inhibiting rods, (f) any mechanism 
reaching a “‘saturation” response. Similar considerations also must be 
taken into account in interpreting the “‘knees” on dark-adaptation 
curves (such as those described by Auerbach & Wald, 1955) especially |} 
after adaptation to various conditioning stimuli. The rod pathway may |] 
have its threshold altered by direct adaptation (both photochemical and 
neural) and also by inhibition, and there are four possible sources of that |] 
inhibition, P cones, D cones, rods themselves and some “‘central” source. | 
It may therefore be possible to detect ‘knees’? corresponding to the | 
thresholds for P cones, D cones, rods inhibited by P cones, rods inhibited 
by D cones, rods inhibited by rods, in addition to the absolute threshold | 
for the rods. Moreover these thresholds are all detected, psychophysically, |] 
by the sensory mechanism, and all or any of the “‘knees” found may not | 
correspond to the receptors themselves but be determined by the thresholds | 
of bipolar cells, ganglion cells or even of units in the visual centres them- 
selves. 
The directional sensitivity of the receptors is a phenomenon that 
seems to be principally connected with the cones, and rods as such are | 
thought to be unaffected (Flamant & Stiles, 1948; Donner, 1958). 
Nevertheless the blue mechanism does show directional sensitivity, and 
this, at first sight, weighs heavily against the hypothesis that the blue 
receptors are rods. The situation is, however, more complicated than that 
because, on the above hypothesis, the cones are involved in determining 
the sensitivity of the rods in photopic vision when these are acting as the 
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basis for the blue mechanism. From the published figures for the extent 
of the directional sensitivity effect (Stiles, 1939) it is not easy to determine 
exactly how the directional effect would be expected to affect the blue 
mechanism. In the dark-adapted parafovea there is no significant directional 
effect from the blue end of the spectrum to about 2 — 600 my, from 
which wavelength the effect increases rapidly to the red end of the spectrum 
(see Fig. 10, Curve 1). This is probably correctly interpreted to mean that 
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Fic. 10. The magnitude of the Stiles-Crawford effect at different wavelengths and 
under different conditions. 


I. The dark-adapted parafovea. The rods are thought to determine the result from the 
blue end of the spectrum to about 600 mp and the cones from there to the red end 
of the spectrum. 

II. The dark-adapted fovea. The effect is predominantly on the cones. 

III. The light-adapted parafovea. Cones probably predominate from about 540 my to 
700 mu, while rods and cones together may be determining the effect from 420 mp 
to 540 mp (data from Stiles, 1939). 


the rods do not show directional sensitivity, but that the cones do. In the 
dark-adapted fovea, where cones are probably involved almost exclusively, 
the directional effect is largest in the blue, less in the red and least in the 
yellow (Curve 2). On the other hand, in the light-adapted parafovea 
(unlike the dark-adapted parafovea) there is as large a directional effect 
in the blue (Curve 3), as in the yellow; but it is not so large as in the fovea 
where the “‘blue mechanism”’ is actually far more sparsely represented. 
This appears to be a somewhat anomalous state of affairs if the blue 
mechanism is only dependent on cones. If cones respond less actively to 
oblique rays, then their inhibitory effects on rods are presumably less when 
the light falls obliquely on them and thus the blue mechanism (rods 
inhibited by cones) should become by that amount the more sensitive. 
Thus, where the blue mechanism contributes in any major way to lumin- 
osity, the directional effect should be smaller than if it depended on cones 
only. This is exactly what is found; the directional effect on the light- 
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adapted parafovea, where the ‘“‘blue mechanism” is more in evidence than ] 


in the fovea, is not as great in the blue as it should be if luminosity de- 


pended only on cones, as it probably does more nearly in the fovea. In the | 


fovea the directional effect in the blue is greater than in the parafovea 
though the “blue mechanism” as such is less in evidence. Thus the direc- 
tional sensitivity of the blue mechanism may be more complex than it 
appears at first sight, and it does not, of necessity, rule out the rods as 
being concerned in the perception of blue. 

The change of hue associated with the Stiles-Crawford effect (Stiles, 
1937) is also interesting and requires explanation. The changes are irregular 
and there are probably several causes for them. First, if cones in general 
are less efficiently stimulated by oblique rays, their inhibitory effect on 
rods is presumably less. At the blue end of the spectrum, therefore, the 
blue mechanism gives a greater response than with direct rays so that the 
colours of oblique rays must appear to match those of shorter wavelength. 
In the long wave end of the spectrum the blue mechanism does not enter 
into the system and the effect must therefore be entirely on the cones, and 
in order to account for the apparent shift towards shorter wavelengths, the 


obliquity of the light must affect the D cones more strongly than it does jf) 


the P cones so that the D cones give a relatively smaller response, and the 


difference between the two is smaller. The final result of the directional |} 


effect on the appearance of colours throughout the spectrum must depend 
on (1) the magnitude of the effect of obliquity on the responses of the P 


and D cones respectively at each wavelength, and (2) the effect of this | 


alteration of the cone response on the inhibition of the rods and con- 


sequently on the size of the residual rod response, i.e. on the response of | | 


the blue mechanism. It is, however, difficult to see why the effect at any 


wavelength should be of the opposite sign, as has actually been recorded | | 


for some wavelengths near 520 mp. 


It thus seems to be premature to dismiss the rods from participation in 


photopic vision and from making their contribution to the production of 
the sensations of colour, and it is quite clear that the peculiarities of the 
blue mechanism are in urgent need of explanation. 


Appendix 
The curves in Fig. 9 have been plotted in accordance with the following 
observations and hypotheses. 
1. The ratios of the numbers of the P and D receptors are purely hypo- 
thetical and have been based on three considerations. 
(a) The derivation of the central foveal luminosity curves from the 
luminosity curves of the protanope and the deuteranope indicated 
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the probability of a decreasing number of D receptors with increasing 
distance from the foveal centre (Willmer, 1955). 

(6) The D mechanism is probably the most variable (cf. protanomaly) 
and is usually most concentrated in the foveal area, as indicated by the 
fact that the retinal field sizes for red and green are much smaller 
than those for blue and yellow. 

(c) There is a possibility that the “dark cones” (Leach & Willmer, 
1950) may be the D receptors and these become less numerous with 
distance from the fovea. Differences in these ratios only yield 
relatively small differences in the luminosity curves by comparison 
with the other differences discussed in this figure. 


2. The ratios of P and D receptors to blue receptors are based on 
Osterberg’s estimates (as replotted by Pirenne, 1948) of the numbers of 
rods and cones in one human retina. The estimated numbers of rods have 
been divided by a somewhat arbitrary factor of 7-5. This has been done in 
order to give some weight to the fact that the rods, and the blue mechanism 
too, have much larger receptive fields than the cones concerned with red 
and green. The factor of 7°5 probably errs on the small side, which means 
that the blue mechanism may have been given rather more influence than 
it actually exerts. For any more precise estimate more data are needed as to 
the relative numbers of rods and cones in different retinal areas and corre- 
sponding information on the sizes of receptive fields for the different 
mechanisms. 

The following example shows the method of calculation :— 


Location of field 25° 
Size of field 50" 
Ratio of rods to cones 125/4 
Estimated P/D ratio 4/1 


1. Estimated percentage contributions 


P mechanism 15 
D mechanism 4 
B mechanism (rods divided by 7°5) 81 


2. Stimulus intensity for each mechanism at different wavelengths 
(Antilog. of the ordinates of the B, P and D curves of Fig. 4). 


420 440 460 480 500 520 540 580 620 my 
P 10 14 22 4g al PRO 2520 2027 a2 40 
D 4 5 8 115 41 105 159 191 159 
Ba 1325 5200, 200 88 20 8 3 I Oo 
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3. Weighted contributions to luminosity according to relative numbers | 
(see 1). 


la 15. 2° e323 s Oly 38 46 og) 6 
D C2. -O2-" 0:03. 0 Ot Om are 6 8 6 
B to7 ~ 162° 102 vi 16 | 2 I fo) 


4. Total stimulus (L) 


109 «164 166 “977 34) 4g 54) Gre 
5. Log,, total stimulus (Log L) 


204~ a1 2°22) 188 5s) 1 09 8 4)! 50 memos 


All the other curves have been plotted in an exactly similar manner. 
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1. Introduction 


The enzyme xanthine oxidase (X.O.) controls the catabolism of biological] 
purines, in particular by metabolizing hypoxanthine and xanthine to uriciij 
acid (Bergel, Bray, Haddow & Lewin, 1957; de Renzo, 1956). Besidest] 
these two fundamental compounds, X.O. has been shown to accept asi} 
substrates a large number of heterocyclic bases: purines (Bergmann &il 
Dikstein, 1955, 1956; Bergmann & Kwietny, 1958a, 1959; Bergmann | 
Ungar, 1960; Bergmann ef al., 1958, 1959, 1960a, b; Fridovich &# 
Handler, 1958b; Klenow, 1952; Kwietny et al., 1959; Wyngaarden, 19573] 
Wyngaarden & Dunn, 1957), pteridines (Bergmann & Kwietny, 1958b;)I 
Hofstee, 1949, 1956; Lowry, Bessey & Crawford, 1949), aldehydes} 
(Blair, 1957; Booth, 1935, 1938; Knox, 1946; Mackler, Mahler & Green, | 
1954), etc. | 

In the present paper we shall try to determine the electronic aspects} 
of the enzymic oxidation of purines. A large number of quantitative} 
experimental data both on the pathway and the rate of oxidation of| 
these molecules by X.O. are available, essentially through the work of 
F. Bergmann and his collaborators. | 


2. The Method 


The calculations of the electronic indices of the molecular structure of} 
purines, relevant to the problem of their enzymic oxidation, have beenif 
carried out by the L.C.A.O. approximation of the molecular orbital 
method of quantum chemistry (see Pullman & Pullman, 1952, for details}} 
about the method) with appropriate parameters adopted for the Coulomilf 
and exchange integrals involving heteroatoms (Pullman & Pullman, 1958d,| 
1960a). A part of the data utilized in this paper originates from previously! 


+ This investigation was supported by a U.S. Public Health Service Grant No. CY-3073 
(National Cancer Institute). 
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published calculations on the electronic structure of purines (Pullman & 
Pullman, 1958a, b, 1959, 1960b; Pullman, Pullman & Berthier, 1956). 

One of the major difficulties encountered in this research is related to 
the possibility of the existence of certain of the compounds under investi- 
gation in a number of tautomeric forms, neutral and ionic. All our calcula- 
tions refer to the predominant neutral tautomeric forms of the molecules, 
such as manifested themselves in a series of experimental investigations 


O 
HN : N HN~ SS | N ‘C ae 
| 
| \| 
SS p Bee 
N NH N NH N NH 
I-Hypoxanthine II- 2-Hydroxypurine Til—.8-Hydroxypurine 
(6-OH—P) =O P) 5 —=@)a| =|?) 
5 O 
HN NH ah its Lek 
eZ LE 
O 
2 NH N N NH co Q N NH 
IV — Xanthine V — 6, 8-Dihydroxypurine VI - 2, 8-Dihydroxypurine 
(2,6-OH—P) (6>8—OH—P) (Q23— OF) 
O O 
NX i NH ny Sea JOm 
N N 
2 NH oN N NH aN NH 
VII - 2-Hydroxy-8-azapurine VIII — (6-Hydroxy-8-azapurine) IX - Guanine 


(2-OH-8—AzoP) (6-OH-8—AzaP) (2-NH), 6-OH—-P) 
N NH 
a bi O 
N wa Ne Be N NH 


XI-(2-Amino, 8-hydroxypurine) 


X— (6-Hydroxy, 8-aminopurine) 
(Q=Niips. 8=OH =p) 


(6-OH, 8—NH2—P) 


Brown & Mason, 1957; Mason, 1954, 1957). These forms are presented 
n formulas I—XI. The significance of this state of affairs will be examined 
na number of occasions in the discussion. For the sake of simplicity the 
ompounds will be designated in the course of the discussion by the 
bvious abbreviations indicated under the name of each of them in the 
ormulas [-XI. 

One of the essential theoretical quantities involved in our discussion 
vill be the nucleophilic localization energy (N.L.E.) of the carbon atoms of 
he purine skeleton. This denomination refers to one of the fundamental 
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theoretical indices, utilized in quantum chemistry for the interpretation 
(or prediction) of the chemical reactivity of carbons towards attacks b 
nucleophilic agents (e.g. negative ions). It is a measure of an important} 
part of the activation energy for such attacks and, in a series of related 
molecules, it is considered as the essential variable part of the activatio 
energy for this type of reaction. Under such conditions the relative values 
of this electronic index indicate the relative reactivities of the compounds| 
or positions studied (Pullman & Pullman, 1952, 1955). 


3. Results and Discussion 


Experimentally, the oxidation of purines by X.O. takes place always 
on a carbon atom doubly bound to an adjacent nitrogen. It is generally, 
supposed that the reaction takes place in two successive steps: 


1. The hydration of the HC = N group. 
2. The subsequent dehydrogenation of this group leading to the enol or} 
the keto form of the oxidized purine. 


The apparent complexity of this transformation makes it fairly im-} 
probable for a theoretical investigation to elucidate, at present, its detailed 
mechanism, at the electronic level. However, it was hoped that such an 
investigation might bring into evidence the particularly important elec- 
tronic factors involved, at least inasmuch as the substrates are concerned 
(the enzyme-dependent properties being more difficult to attain). Welf 
shall consider such factors in connection, separately, with the pathwayif 
and the rate of the oxidation. 


(t) The pathway of the oxidation 

In purine and a series of its monosubstituted derivatives, there exist 
several HC = N bonds susceptible a priori to be the site of the oxidation. |f 
As arule, however, the reaction occurs essentially at a preferential position. 
The search for a correlation between this preferential reactivity of a given/ 
HC = N bond and its different electronic characteristics pointed to the’ 
existence of a relation between this reactivity and the value of the nucleo-\f 
philic localization energy of the C atom of such bonds: in the case of the 
availability of two or more HC = N bonds, the oxidized carbon is the one} 
having the smallest nucleophilic localization energy (the greatest ability to be 
attacked by nucleophilic agents such as OH~ ion). This proposition is| 
illustrated in Table 1. It can be seen that there is only one exception to our| 
rule: the 8-OH-P which is oxidized at the C atom which has the highest’ 
N.L.E. However, this exception is not a really important one as the two, 
available carbons have in fact very similar N.L.E.s and as the overall re-| 
activity of this compound is relatively very small (see p. 183). 
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It may be noted that Table 1 does not contain the 2-OH-P. This is 
due to the fact that this molecule reacts essentially in the tautomeric form 
II, as shown by Bergmann and his colleagues in studies with its N,- 
methylated derivative (Kwietny, Levin, Bergmann & Brown, 1959). Con- 
sequently, although it has two unsubstituted secondary carbons, it has 


TABLE 1 


Nucleophilic 
Available localization Formal 


Compound IES aces change Experiment references 
(in 8 units)t 
6-—OH-P 2 2036 + 0172 Bergmann & Dikstein, 1956 
8 2°349 + 0:028 
8-OH-P 2 2°376 + 0:077 Bergmann & Dikstein, 1956 
| 6 2°348 + 0°031 
2-NH,-P 6 2-180 +o0-098 Bergmann, Levin & Kwietny, 1958 
8 2-217 + 0:084 
12 2 2°323 + 0-098 Bergmann & Dikstein, 1956 
6 2°176 + 0'093 
| 8 yap ty 10) ++ O°105 
8—-Aza P 2 2:283 +o-109 Bergmann, Levin & Kwietny, 1959 
6 2°120 ++ O*110 
8—NH,-P 2 2°362 +0066 Bergmann, Kwietny, Levin & 
| 6 2238 + 0:059 Engelberg, 1960 
6-NH,-P 2 2°320 + o102 Bergmann & Kwietny, 19584; 
8 2°246 + 0:072 Wyngaarden & Dunn, 1957 
4-NH, pyrazolo 3 2°582 —o:047 Feigelson, Davidson & Robins, 
(3,4-d)pyrimidine§ 6 2°284 + Ort 17 1957 


+ The hydroxylation takes place at the underlined position. 
| } B being the usual exchange integral of the molecular orbital method whose approxi- 


mate value is 20 kcal/mole. 


_ § An isomer of adenine which is an important antitumor agent (Pullman ef al., 1959). 


only one CH = N group susceptible to an addition reaction. In fact, the 
case of this molecule greatly supports the hypothesis of a hydration as an 
intermediate step in the action of X.O. Thus this compound undergoes 
the hydroxylation at its C, although the N.L.E. of this carbon (= 2:197 B) 
is greater than that of its C, (=: 1:840 8). But a hydration can only occur 
at the N,-C, bond. 

We have also reproduced, in the last column of Table 1, the formal 
7-electronic charges (Pullman & Pullman, 1959b) of the available positions. 
No correlation seems to exist between this index and the pathway of 


9xidation. 
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(a) The rate of the oxidation 


Although the oxidation of purines by X.O. involves a comple 
mechanism, composed at least of two successive steps, hydration andj 
dehydrogenation, and although, consequently, the rate of the oxidationg 
depends on the rates of these two reactions, the available experimental 
data only refer to the overall transformation. In our theoretical investiga 
tion we have tried to analyse the factors that might be responsible for the 
relative rate of each of the partial reactions. However, the most significan 
correlation that we have been able to arrive at concerns the rate of the 
total transformation and relates this quantity, again to the value of the 
N.L.E. of the position attacked in the initial step of the reaction. It 
appears, in fact, that the rate of the oxidation of a purine substrate by X.O 
is the greater, the smaller the N.L.E. of the carbon atom which undergoes thé 
hydroxylation. 

This proposition is illustrated in the data collected in Tables 2, 3 and 4 
The interpretation of these tables requires, however, one preliminary re 
mark. This concerns the problem of the existence of the different purineg 
quoted in these tables in different tautomeric forms, in particular i 
different states of ionization. Broadly speaking, the compounds studied 
here may be divided from that point of view into three groups (which 
correspond to the three tables): 


1. Purine, aminopurines, monohydroxypurines and aminohydroxy. 
purines; these molecules exist at pH 8-3, which is the optimum p 
of the reaction, mainly in their neutral form (Albert & Brown, 1954: 
Brown & Mason, 1957; Mason, 1954, 1957). 


2. The 8-azapurines, whose pK,s are around 5 (Bendich, Giner- 
Sorolla & Fox, 1957; Leese & Timmis, 1958) and which consequently 
exist at the optimum pH in the form of anions, the proton bein 
known to depart from the triazolo ring. 


3. The dihydroxypurines, with pK, < 8 (Mason, 1954) but whose 
anionic form is not known exactly. 


Our calculations have all been performed, for technical reasons, for the 
neutral form of all these molecules. It is obvious, consequently, that the | | 
will be of a different degree of reliability for our present problem wher 
applied to these different classes of purines. Naturally, they are quite 
applicable to the first group. In so far as the relative values of the N.L.E.s} 
are concerned they also most probably reflect the order existing in the 
monoanions of the 8-azapurines, as all these compounds have pK, value 
of the same order of magnitude and lose their proton from the same site 
On the contrary too much significance cannot be given a priori to the 
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results concerning the third group of molecules. Whatever it may be, it 
seems obvious that it is preferable to discuss each of these groups 
_ separately rather than all of them together. 

__ First group—Table 2. The compounds may be divided, broadly speaking, 
, into three classes: very good substrates (hypoxanthine, rate = 70, with 
xanthine = 100), moderate substrates (rate ~ 20), bad substrates (rate 
bx 2). To these three distinct ranges of rates correspond three distinct 


TABLE 2 
Experiment 
P 
Initial N.L.E. 
Calculated Product rate of the Experiment 
Compound form measured % oxidized C references 
\ ; (in B units) 
6-OH-P I Uric acid 70 2036 Bergmann & Dik- 
stein, 1956 
pes Uric acid 20 2:176 Bergmann & Dik- 
stein, 1956 
8-NH,,6-OH-P xX 2,6-OH,8—-NH,-P 18 2:072 Bergmann, Kwietny, 
Levin & Engel- 
berg, 1960 
2-NH,-P 2-NH,,6-OH-P 7/5 27180 Bergmann, Levin & 
Kwietny, 1958 
2-OH-P II 2,8-OH-P 16 2197. Bergmann & Dik- 
stein, 1956 
8-OH-P Ill 2,8-OH-P Ta 2°348 Bergmann & Dik- 
stein, 1956 
8-NH,-P 2,6-OH,8-NH,-P 11°33 2°238 Bergmann, Kwietny, 
Levin & Engel- 
berg, 1960 
2-NH,,8-OH-P XI 2-NH,,6,8—OH-P Tes 2°344 Bergmann, Levin & 
Kwietny, 1958 
6—-NH,-P 2,8-OH,6-NH,-P ~1 2°246 Bergmann, Levin & 
| Kwietny, 1959 
'2,6-NH,-P 2,6-NH,,8-OH-P 0-5 — 2284. + Wyngaarden, 1957 
2-NH,,6-OH-P IX 2-NH,,6,8-OH-P ° 2°392 Bergmann, Levin & 


Kwietny, 1958; 


Wyngaarden,1957 
| + Xanthine served as standard (100). 

ranges of the N.L.E.s. Hypoxanthine is characterized by the smallest value 
of this index (2-036), the moderate substrates by intermediate values, 
comprising between 2072 and 2-197 and the bad substrates by high 
values, superior to 2°2468. Guanine which is not a substrate for X.O. 
has the greatest value of the N.L.E. (2°392 8). The satisfactory aspect of 
this correlation is particularly evident when isomers are being compared. 
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Thus, for example, among the three isomeric aminopurines, the 2-NH,- 
is a moderate substrate, comparable to purine itself, while the 8- -NH, 
isomer and the 6-NH, isomer (adenine) are very poor substrates, a result 
in complete agreement with the values of the corresponding N.L.E. of the 
attacked carbons. The case of the three isomeric monohydroxy-purines 
which from the point of view of the relative rate of oxidation belong eacl 
of them to a different class of compounds (6-OH-P excellent substrate, 
2-OH-P moderate substrate, 8-OH-P bad substrate) is even still more 
striking: the values of the N.L.E.s account entirely for this order. 
Second group—Table 3. The order of the relative rates of oxidation of 


N.L.E.s on the appropriate C atoms. There is one exception, however; 
which is the 2~-OH-8-azaP whose experimental reactivity is much smalles 


TABLE 3 
Experiment 
Initial N.L.E 
Chseund Calculated Product rate of the 
P form measured % oxidized C 
+ (in B units) 
6-OH-8-AzaP VIII 2,6-OH-8-—AzaP 41 2:018 
2-NH,-8-AzaP 2-NH,,6-OH-8-AzaP 21 2°133 
8—AzaP 2,6-OH-8-—AzaP 14 2°120 
2-OH-8-AzaP Vil 2,6-OH-8-—AzaP 6-1 2036 
6—NH,-8—AzaP 2-OH,6-NH,-8-AzaP B)07% 2°286 


For these experiments, see Bergmann, Levin & Kwietny, 1959. 
+ Xanthine served as standard (100). 


than suggested by the value of the N.L.E. of its susceptible carbon. The 
case of this molecule is, however, exceptional from the experimental point 

of view too, as the sample utilized by Bergmann et al. (1959) was obviously | 
hydrated before the action of X.O. started. A possible interference with the 4 
specificity of the enzymic reaction may account then for the low yield o : 
the oxidation. | 

Third group—Table 4. This series includes two excellent substrates, 
xanthine and the 6,8-OH-P (both reacting at 100%) and the 2,8-OH- 
which practically is not a substrate for the reaction at all. 

The case of this last compound is, of course, clear, as it has no HC = 
bond to be hydrated. The low value of the N.L.E. of the 6,8-OH-P i ist 
also in rather good agreement with its high rate of oxidability. On the} 
contrary the results for xanthine are not satisfactory: the high value of the 
N.L.E. of its susceptible carbon is in disagreement with its excellent 
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behaviour as a substrate for the enzyme whose very name is xanthine 
oxidase. Although it is particularly disappointing to observe that among 
}all the compounds which we have investigated, xanthine is the only real 
!exception to our correlations, it must however be remembered that, as 
stated previously, this molecule is ionized at the experimental pH values 


TABLE 4 
Experiment 
ae INGE: 
Initial 
Gamssuntd Calculated Product aie of the 


form measured ey oxidized C 
ie (in B units) 


2,6-OH-P IV Uric acid 100 2°310 
6,8-—OH-P V Uric acid 100 2:066 
2,8—OH-P VI Uric acid o-2 2038 


For these experiments, see Bergmann & Dikstein, 1956. 


and that the structure of the anion is unknown.t Consequently, our calcu- 
lations which apply to the neutral species may be far from corresponding 
to the actual situation. 

The calculations that we have carried out, in connection with the problem studied 
in this paper, included the evaluation of a number of other electronic indices which 
seemed likely to intervene in the mechanism of the transformation involved. This 
included in particular the evaluation of the amount of the loss of resonance energy 
upon the initial hydration, of the gain in resonance energy upon the subsequent 
dehydrogenation, of the overall change in resonance energy upon the oxidation 
and of the electron-donor properties of the intermediate hydrated product. The 
possible significance of the resonance energy charges is, of course, obvious. The 
possible importance of the electron donor properties of the hydrated purines may 
reside in the involvement of an electron transfer in the second step of the reaction 
(X.O. utilizes a flavin (FAD) as a coenzyme (Fridovich & Handler, 1958a), and 
‘metal cations as activators). Following the theory developed by B. & A. Pullman 
(1958c) the electron donor abilities of biochemicals are determined by the energy 
lof their highest filled molecular orbital. 

All these different data are reproduced in Table 5. It can be seen that no general 
‘correlation may be observed between any of these indices and the characteristics 
of the reaction. Some correlations of a limited scope and of perhaps some signi- 
ficance, seem, however, to exist. Thus if the purines which exist at the experimental 
conditions in the neutral or ionized forms are considered separately and if a dis- 
inction is also made between the purines which undergo the oxidation at the 
pyrimidine ring and those which undergo the reaction at the imidazole ring, a 
partial antiparallelism is observed in each group between the rate of the oxida- 


tion and the loss of the resonance energy upon hydration of the initial base. It 
l 

| + Cavalieri (Cavalieri et al., 1954) attributes the first ionization of xanthine to the 
departure of the hydrogen from N;. Ogston (1935) considers the enolization at position 


6 and attributes the ionization to the deprotonation of this OH group. See also Nakajima 


and Pullman (1959). 


7.3. 13 
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may also be remarked that, although in the great majority of cases the first oxida-] 
tion is accompanied by a loss of resonance energy, there is exceptionally a gain of 
resonance energy in the case of the oxidation of the particularly good substrates. . 
Finally, while all the hydrated purines should be relatively good electron donors,}) 
no strict correlation seems to exist between the energy of their highest filled mole 
cular orbital and the ease of the reaction. 


TABLE 5 
Difference of resonance Energy of the 
Ovidued Relative energy between the initial highest molecular 

rae Compound rate and orbital of the | 
% the hydra- the final hydrated product} 
ted product product (p.h.o.0.)f | | 
6-—OH-8-AzaP 41 + 0°568 — O-1I2 + 0448 | 
2-NH,-8-AzaP 20 aie Cag + 0:007 + 0°332 
8—AzaP 14 O77 + 0°057 + 0°381 | 
6-NH,-8-AzaP 3°1 + 0°853 +0421 | 
yeh 6-OH-P 70 + 0°588 — 0°079 + 0-143 | 
Pyrimidine 12 20 + 0°738 + 0:088 + 0°407 

6-OH,8—NH,P 18 +0610 + 0°184 

2-NH,-P 17ES + 0-761 + 0°039 + 0:250 

8-OH-P es + 0°978 10-179 + 0°290 

8-NH,-P 3 + 0°874 + 0127 + 0:043 

2-NH,,8-OH-P I + 0°924 + 0:162 
eaCrars 16 + 0:689 — 0'029 + 0:269 ) 
: — = ~I -+- 0°700 + 0'021 + 0°149 
Imidazole 2,6-NH,-P o's + 0-781 + 0183 ; 
2-NH,,6-OH-P ° + 0881 + 0°034 
t In B units. 


Altogether, it seems difficult to draw any conclusion as to the importance of 
these limited correlations. 


It appears thus that the mechanism of the enzymic oxidation of bio 
logically important and related purines by xanthine oxidase, althoug 
undoubtedly a complex process, may be rather satisfactorily related to the! 
value of the N.L.E. of the carbon atom on which the fixation of oxygeni 
occurs. This state of affairs necessarily suggests that it is the initial attack 
by a nucleophilic agent, most probably the OH~ ion, which represents the 
essential step of the reaction, determining both the place and the rate off 
the oxidation. It is hoped that the study which is under way of other cate- 
gories of molecules which may function as substrates for X.O. will help 
to explain the role of the other factors possibly involved in the reaction. 
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The Estimation of Relative Fitness of Drosophila 
Populations 


P. J. CLARINGBOLD{ AND J. S. F. BARKERt 


(Received 8 November 1960) 


A mathematical model of interspecific competition is defined which 
may be used to describe the relative fitness of Drosophila populations. 
An outline of the numerical methods required to apply the model to 
experimental data is given. This model has been tested in a number of 
experiments, two of which are presented here as illustrative material. 
Log relative fitness is found to be significantly varied by some of the 
experimental treatments but is additive over the variables examined. 


1. Introduction 


Although the concept of relative fitness is basic in the theory of population jf 
genetics, no adequate means of determining the parameter experimen- 
tally is available. As an estimate of absolute fitness Carson (1957) uses | 
biomass (the wet weight of hatching adults) produced per unit time in a 
population in equilibrium with its environment. An estimate of relative 
fitness termed the competitive index was defined by Knight and Robertson 
(1957) as the ratio of the numbers of a stock under test to a standard stock 
after one generation of competition, the initial ratio and environmental | 
conditions being standardized. Wallace (1959) has measured relative fitness 
as the mean number of offspring produced by single pairs of flies in physi- | 
cally independent, i.e. not competing, pairs of populations maintained | 
simultaneously. q 
In classical studies of simply segregating Mendelian populations, e.g. I 
diallelic loci or chromosomal rearrangements, the concept of adaptive value | 
relative to a defined genotype is used to explain the deviations of observed | 
genotype frequencies from those predicted on the basis of equal absolute | 
fitness of the three genotypes (see reviews of Dobzhansky, 1951, 1955, | 
1957). | 
This approach cannot be used in studies of the relative fitness of popu- | 
lations in which it is impossible to define the genotypes of individuals. 
A mathematical model which may be used to describe the relative fitness | 
of two or more such populations is presented below, together with tests of | 
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its adequacy under a variety of experimental conditions. In the work, one 
author (J.S.F.B.) was solely responsible for the experimentation and the 
other largely responsible for statistical methods. 


2. Methods 


The stocks used in the experiments were: 

(1) A vermilion-eyed mutant strain (v) of Drosophila simulans, main- 
tained in the laboratory by mass mating until October 1958, and since, in 
two population cages each averaging approximately 3,500 adults. 

(11) The wild-type Oregon-R-C strain of Drosophila melanogaster, 
maintained in this laboratory since September 1958, in a population cage 
averaging approximately 4,000 to 5,000 adults. 

When individuals of the stocks of these two non-interbreeding species 
are placed together in a closed population, the outcome of the competition 
is invariably the elimination of D. simulans vermilion within a few genera- 
tions. This competition has been studied in population bottles under a 
variety of experimental conditions. The population bottles used were an 
adaptation of those of Reed & Reed (1948), and have been described by 
Barker (1960). When a population was started, only one of the medium 
bottles contained medium, the other being empty. The proportions of the 

‘two species in each population were determined every two weeks, this 
period having been calculated as the generation interval of D. melanogaster 
under these conditions (Barker, unpublished data). Therefore, two weeks 
after initiation, each population was etherized and counted by eye colour, 
‘i.e. either the wild-type eye colour of D. melanogaster or the vermilion of 
D. simulans. After counting, the flies were returned to the central chamber 
‘of the population bottles and the empty medium bottle replaced by one 
‘containing fresh medium. Every two weeks thereafter, the older bottle was 


TABLE I 
Composition of the media used in the population bottles 


Constituent Medium F Medium S, Medium S, 
Agar (g) 8 10 10 
Water (ml) 600 700 700 
Treacle (ml) 80 80 80 
Yeast (g) 150t 30f 30t 
Cornmeal$ (g) 80 80 80 
Nipagen M (g) I 2 4 


+ Compressed live baker’s yeast (Saccharomyces cerevistae)—killed by boiling during 
medium preparation. 

t Inactive dried yeast powder (heat killed Saccharomyces cerevisiae). 

§ The cornmeal used is a proprietary breakfast food. 
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replaced by one containing fresh medium. A medium bottle thus remained } 
in the population bottle for four weeks. 

The amount of medium in each bottle was standardized at 10 ml. The 
medium was seeded with live yeast using 15 g compressed yeast in 30 ml } 
water. The amount of yeast added in this way was in some cases varied as) 
an experimental treatment. The composition of the medium also was) 
varied and, as it had an effect on relative fitness, should be noted (see. 
Table 1). The population bottles were stored in a constant temperature 
room at 25 + 05°C. This room was maintained at a constant light period. 
of 12 hours per day (6 a.m. to 6 p.m.). 


3. Mathematical Model 


| 

Selection between two non-interbreeding competing populations when | 
there is no overlap of generations is discussed by Haldane (1932, p. 172). 
If the number of offspring in the subsequent generation per parent in the 
present generation is ms, for population S (say D. simulans) and 
Ny = (1 — k)ng for population M (say D. melanogaster), then k is the co- 
efficient of selection in favour of population S and Wy,, = 1 — k is the 
relative fitness of population M to S. 

In the experimental populations, the total population number per | 
generation is not determined each generation, but the relative frequencies 
of the two species are determined by a census of the living adults once per 
generation. The census is repeated at intervals of the mean generation 
length. Thus if p),; is the proportion of population M in generation 2, 
where 7 ranges from zero upwards, then: 


Puen Pui = "wp (1) 
Psi+s [Psi = Ms; (2) 
Pui + Ps = 1. (3) 
Let 1, = PauilPsi- (4) | 
Then Was = Myr/Ngy (5) 
= Psi Pui #036) [Pui Psi ners (6) 
Sea (gey 1 /Tis (7) | | 
= (r/o). (8) 
The range of variation of relative fitness so defined is: | 
0 > Wy, > o. (9) | 
An alternative scale of measurement of relative fitness is given by: 
Sus = log Wus, (10) 


with the consequences that equal fitness is defined by fy; = 0, and the 
range of variation is symmetrically disposed about this point. It is clear 
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that fxs is the difference between the Malthusian parameters of the two 
competing species (Fisher, 1930). 

If it is assumed that the interspecific competition is on the same basis 
as the intraspecific competition, then the distribution of the proportions 
of equation (4) may be taken to be the Bernoulli binomial (Aitken, 1947), 
and estimates of relative fitness may be obtained in a standard manner 
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-1.0 
O125+ 
0.25¢ 
-0.5 
0.5} 
F 1ol Foot 
2.0F « 
0.5) 
4.0F 
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1 , 
° I 2 
YEASTING 
Lor MEDIUM O 
- 
40} ey 
14+ 
20} r 
WwW ¢ 1.2} 
MEDIUM |} 
1O} Ne) Le ey 
[ MEDIUM 2 
6 0.8} er 
—= — —L $$ $$$ 
° | 2 
YEASTING 


Fic. 1. Relative fitness and heterogeneity factor estimates for the medium-yeasting 
combinations of Experiment A. The geometric mean heterogeneity factor for Experiment 
 B is indicated by the cross. 


using the method of maximum likelihood (see Fisher, 1925; Rao, 1952). 
Let py; and ps; be the observed proportions of the species, and 7; be the 
total number of flies, in any generation. The log likelihood L of a series of 
observed proportions with this distribution is given in terms of the expected 
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proportions, themselves a function of the fitness parameter (Fisher, 1925). | 
Differentiation of this function with respect to the parameter, and equation 
of the differential to zero, yields the maximum likelihood estimate of 
fitness, thus: 


aL |df = 3, [(Pui/Pui) — (Psilbsi)]. 4: = 9 (11) | 


where Z; = (»tW*) (7) W* + 1), (12) 
and N = Number of generations scored. 


The aptness of this estimate is tested by the chi-squared, Fisher (1948): | 
x 
ee 2 (Pari — Bari)”: /Pui Bsi- (13) 


If there is no evidence of heterogeneity, the standard error of the estimate 
is given by the square root of the reciprocal of the negative of the expecta- 
tion of the second derivative: 


s(f)=— ES (nla Bs)-Z (14) | 


On the other hand, if the chi-squared is significant, the model does not 
hold, and the square root of the ratio, 


pK 1), (15) 
or heterogeneity factor, is used to inflate the standard error of the estimate 
(Claringbold, 1955). If the expectations of equation (13) approach o or 1 
such that n;py,; or n;ps; becomes small, say less than 5, the distribution of 
(13) is not that of y?, and the heterogeneity factor will have a smaller | 
expectation than 1. This will occur if the absolute value of log relative | 


fitness is large, unless the population size is correspondingly large (cf. 
Medium o of Fig. 1). ; 


4. Numerical Analysis 


Comparison of equation (11) with similar equations derived in log dose- | 
quantal response line studies (Claringbold, 1955) reveals that the estima- 
tion problem is equivalent to fitting the logit transform to a series of pro- 
portions. With this analogy generation number is equivalent to log dose 
and the known initial ratio of the competing species is equivalent to a 
knowledge of the median effective dose. Only one parameter requires 
estimation, relative fitness, which is analogous to the slope of a dose- 
response line. 

It is well known that the fitting of quantal dose-response lines is time 
consuming unless some simplifying assumption is made. Usually it is 
assumed that the probit or logit is proportional to an angle. Provided the 
number of observations per generation is constant or otherwise large, this 
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assumption considerably reduces the numerical difficulties. The assump- 
tion is only Ce however, if the expected proportion remains in the 
interval 5 to 95% (cf. Claringbold, Biggers & Emmens, 1953) and therefore 
cannot be spplied to the present problem in which many of the observa- 
tions exceed 95%. In view of the number of sets of data requiring analysis, 
it was decided to programme solution of the estimation equation and 
related equations, on an automatic computer (SILLIAC). 

The log likelihood curve is bell-shaped and thus the derivative (11) 
when plotted against fitness forms a sigmoid curve, the central region of 
which crosses the abscissa with a negative slope. The equation is solved 
by an iterative procedure, i.e. by successive improvement of a series of 
values that converge towards the solution. The initial ratio is chosen so 
that the first generation of competition will yield a useful amount of infor- 
mation. The possibility of starting with very small initial ratios is limited 
by the total population size available with the technique. Therefore sub- 
stitution of the observed proportions in equation (7) via (10) gives the 
first member of the series. Then equation (11) is evaluated for the com- 
plete data. Two possibilities arise, (i) the result is greater than or equal to 
zero, or (11) the result is negative. In the former (latter) possibility a further 
member of the series is obtained by adding (subtracting) a constant to the 
previous value, and the differential (11) again computed. This procedure 
is continued until two successive members of the series yield differentials 
one above or on, and the other below the abscissa. 

By means of the rule of false position (Whittaker and Robinson, 1944), 
which involves a series of either linear interpolations or extrapolations, the 
series of trial values rapidly converges so that the difference between the 
final two members may be set less than some prearranged small constant 
(say 10-%). The usual convergence constant (cf. Fisher & Yates, 1957) is 
10% of the standard error of the estimate. Using this value some hand 
computers have recorded failure of convergence, presumably owing to 
failure of carrying enough significant figures. The constant 10-% used in 
the SILLIAC programme is approximately 1% of the standard error, 
and in no case out of 237 sets of data did this occur; even though the 
initial value was very discrepant in some sets and considerable hetero- 
geneity was present. Both of these occurrences have been suggested as the 
cause of failure to converge (cf. Finney, 1952). 

Magnitude control was achieved in the computing programme by use 
of an interpretive floating point system. Per set of data the time spent in 
the input of information and the output of results was (N = 3) approxi- 
mately 12 sec, while the time per iteration was approximately 1 sec. The 
average number of iterations was approximately 8, which could, of course, 
be reduced if the less stringent convergence criterion had been applied. 
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5. Experimental Evaluation of the Model 


An initial series of experiments was carried out for the purposes 03 
developing technique and testing the model under conditions of different 
initial ratios and population numbers. Although there appeared no syste-f 
matic deviation of the observed proportions from their expectation, con- 
siderable heterogeneity was found. Such heterogeneity can only arise (i) 
if the model does not adequately describe the interaction between the 
species, or (ii) if the relative fitness varies with time due to uncontrollec 
environmental factors. 


TABLE 2 


The design, the factors and their levels of application, 


of the two experiments 


Coded Experimental design 
factor 
levels (Al— 2 X<a2 ane ane ISPS OY IS OY) SE FIRS) 


Factors and levels 


Pres Pre- 
Starting Source of . Yeasting Starting experi- =P 
time D. simulans egies (ml) time mental mental 
medium yeastin = 
(ml) | 
° 2:xil: 59 Cage 4 F ° 6:v:60 F ° 
I Q: xii: 59 Cage 3 So orl 13:v:60 iS} ol 
2 03 


Two experiments were carried out to examine these possibilities (se 
Table 2). Both experiments were of the fully randomized factorial type, 
the replication factor being indicated as the last term in the ‘“‘equation’™§ 
defining the experiment. In each experiment one half of the populations 
were set up on the one day, the other half, one week later. Two separates 
source populations of D. szmulans v, which had been maintained in popula | 
tion cages, were used in Experiment A. In Experiment B the possiba 
influence of the medium and yeasting on which the D. simulans v flies fo 
the initial populations had developed is studied. The results of the two: 
experiments, in terms of the mean values of relative fitness, log relative 


fitness, and heterogeneity factor for each treatment combination are give 
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in Tables 3 and 4. The detailed results of these and other experiments 
will be presented in a later paper. However, it'may be seen that the fitness 
of melanogaster (Oregon-R-C) relative to that of simulans v is quite high, 
‘so that in most of the populations of these experiments the latter species 
is eliminated by the third generation. 


TABLE 3 


Mean values of relative fitness (W), log relative fitness (f ), and geometric 


mean heterogeneity factor (a) for each experimental factor combination in 
Experiment At 


Experimental factors and their coded levels 


W yy v’ 
Starting Source of Medium Yeasting 
time D. simulans 
° ° ° ° 48-60 1°55 1°59 
I 81°65 IQ! 0°30 
2 73°10 1°83 0-06 
I ° ZA [OI 1°43 2°30 
I 19°61 127) 2°25 
2 29°14 1°33 2°49 
2 ° 5°78 0°76 5°30 
I 13°31 165 2 8:60 
2 16°55 360} 2°54 
I ° ° 30°80 1°49 0°44 
I 35°99 1°54 0°99 
Zz 75°07 1°87 O'OI 
I ° 12°72 1°04 4°19 
I 15°20 0°94 0:67 
2 24°51 1°39 331 
2 ° 8-69 0°93 0'97 
I 8-92 0°94 9°41 
7) 12°16 1:08 6:03 
I ° ° ° 10°82 0°97 77, 
I 29°69 1247 O'll 
2 345°45 1°92 0-08 
I ° 5°64 0°75 0°93 
I 6:97 0°84 3°14 
2 710 0°85 18-o1 
2 ° 4°68 0°67 3°97 
I 4°11 0-61 Fis 
eZ 5°68 O'75 21°98 
I ° ° 6:60 082 19‘19 
I 19°57 1°28 0°35 
2 34°72 1°54 0°23 
I ° 7°04 0°84 3°66 
I 5°95 O°77 1°31 
2 8-73 0°87 8-23 
2 ° 3°70 0°56 9°80 
I 5°53 0-72 6°35 
2 6:26 0:80 8-03 


Se SSTSCSC—S =e a 
| + f does not equal log;,)W, as f and W are each means of the estimates for the 
‘replicates in each experimental factor combination. 
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TABLE 4 | 
Mean values of relative fitness (W), log relative fitness (f), and geometric 


mean heterogeneity factor (3') for each experimental factor combination in} 
Experiment Bt 


Experimental factors and 
their coded levels 


W i uv 
Starting Fre- Pre- 
ane experimental experimental 
medium yeasting 

° ° ° 21°53 E32 1°30 

I rie 1°16 3°42 

I ° 66°56 1°63 0°58 

I 43°35 1°59 2°88 

I ° ° 9°55 0°99 3°77 

I 18°70 ig) 1°51 

I ° 8-41 0°90 4°75 

I 10°34 1:00 2°30 


oF does not equal log;,W, as f and W are each means of the estimates for al 
replicates in each experimental factor combination. 


TABLE 5 
Analysis of variance of the data of Experiment A 


Standardized mean squares or product 
Source of 


variation D.F. 
f2 v? fv (f —v)? (f+ v)? 

Time I 24°742T 2°267 — 7°488 41°985+ 12°0333 
Stock I 1°814 0013 — 0'149 2125 1°529 
Medium (2) 

© v. 2 (a) I 56:165t 27:°218f — 39°106 161°s595t 5°171§ 

o+ 2v. 1 (b) I 3804 2'013 — 2-760 11355 0°279 
Yeasting (2) 

Ov. 2 (a) I r1-186f 1°658 — 4°306 21°456§ 4°232§ 

o+2v. 1 (b) I 0031 O'129 0068 0'024 0296 
Interactions (29) 
Medium x 
yeasting (4) 

aaa I 2°732 10°378t = BoGeOr 23°7541 2°466 

aXb I 0052 0173 — 0°095 O'415 0°035 

bxa I 2'639 4'316§ — 3:378 13°711§ 0°199 

b xb I 0°649 1084 0-840 0'053 3°413 
Remainder 25 0°371 0°480 — O-lIs 1081 0-621 
Error 36 1°000 1‘000 — O'514 3028 0'972 

8sr= Sy Try 
O°31I 0°474 — 0514} 
ee <cOLOO le PPi<lo-on, §$P < 0°05. 
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TABLE 6 
Analysis of variance of the data of Experiment B 


Standardized mean squares or product 


Source of variation D.F. 


ie v? fv (f—v)? (f+)? 
Time I 15°242T 0625 — 3:093 22°053{ 9681 
Medium I 1°258 0-609 0877 O'113 3621 
Yeasting I 0045 0-016 — 0'031 Onrag — 0'001 
Interactions 
Time X medium I 5°682t 1°406 — 2:831 12°750t 1°426 
Time x yeasting I 0-379 1'063 — 1-216 4°874 eee} fe) 
Medium x yeasting I 07000 0094 O'O15 0064 O°124 
Time X medium 
Q x yeasting I O'212 0-016 0:062 O'104 0'352 
irror 16 1‘000 1'000 — 0°385 2°770 1°230 
st sSy= fy 
o'812 0800 — 0°385 
jk <= 0'0r- ie 1S =< oyvaiy. 


For every experimental population an estimate of log relative fitness (f), 
ind of log heterogeneity factor (v = log v’, see equation 15), was computed. 
The log transform was applied to the heterogeneity factor since it renders 
the variance of the measure independent of its magnitude (Rao, 1952, 
). 225). An analysis of covariance of these variates was made, see Tables 
; and 6. In both experiments replication provided an independent esti- 
nate of the error variance and covariance. The significance of all inter- 
iction terms was examined against this error, and the non-significant items 
vere pooled into a residual term (in Experiment A) merely to shorten the 
able. 

In both analyses a negative correlation coefficient is found between log 
elative fitness and log heterogeneity factor. Since the difference between 
he two estimated coefficients is not significant, the weighted mean co- 
‘ficient obtained by transformation to z (see Fisher, 1948, p. 203) is 
calculated, 


3 — 2 =.010 4. 0°33, 
% = — 052+ 0°15, (16) 
= — 0°48. 


Sh 


\ correlation coefficient of approximately — 0-50 implies that the ratio 
f the major to the minor axis of the correlation ellipse is 3 : 1, viz. 


( cctncuel lente coal omg emp 0 
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The direction of the major axis is given as the difference between the} 
standardized variates. Further analyses of variance were therefore con-] 
structed on this basis, see Tables 5 and 6, and it is seen that there is less] 
pronounced treatment variation in the direction of the minor axis of thef 
ellipse. The relationship between log relative fitness and log heterogeneity 
factor is maintained over all combinations of the experimental variates 
(see Fig. 1). 

The analysis of variance of log relative fitness values indicates that thef 
parameter is strongly varied by secular changes, and properties of the 
medium in Experiment A. The absence of any significant interactions 
with the one exception at the 5% level in Experiment B, out of a total o: 
33 single degree of freedom tests, indicates that the log relative fitness 
measure is additive over the variables examined. 

The results of Experiment B, in which media and yeasting conditions 
were those coded 1 in Experiment A, were uniformly heterogeneous ( 
geometric mean 0’ = 2:16), that is, approximately the value obtained if 
Experiment A, central point, which indicates a consistency of results. 


6. Discussion 


It has been pointed out above that heterogeneity can only arise eithe 
if the model does not adequately describe the interaction between the 
species, or if the relative fitnesses vary with time due to uncontrolleq§ 
environmental factors. If the former is correct, then obviously this mode} 
cannot validly be used for the estimation of relative fitness. However, idl 
the model does hold but uncontrolled environmental factors give rise td 
the heterogeneity observed, then it should be possible to improve the 
experimental technique so that this heterogeneity is removed. 

The fitness of a particular D. melanogaster population is described by 
the model presented here as a single parameter, fitness relative to that of < 
standard D. simulans v population taken as one. To understand the superio 
relative fitness of melanogaster and to analyse the interaction between thé 
two species, the various components of fitness must be considered, for 
example, (i) fecundity, (ii) hatchability, (iii) rate of development of im4 
mature stages, (iv) the numbers of adults emerging, (v) longevity of thé 
adults. It should be pointed out that various attempts have been made td 
estimate relative fitnesses of genotypes by combining estimates of varioug 
of these components (e.g. Wallace, 1948; Buzzati-Traverso, 1955, 
Dobzhansky & Levene, 1955; Spiess & Schuellein, 1956; and Spiess. 
1958). ‘This technique for the estimation of relative fitness is particularl 
tedious and, although it may be satisfactory in some cases (e.g. Spiess} 
1958), its potential dangers should not be overlooked (Barker, 1958; se¢ 
also Dobzhansky, 1956). The rate of development of immature stages a 
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25°C is slightly faster for melanogaster than for simulans (Moore, 1952; 
-Herskowitz, 1953), while the mean generation interval of melanogaster 
(Oregon-R-C) is two to four days shorter than that of simulans v (Barker, 
unpublished). As the populations were counted at intervals equal to the 
‘Mean generation interval of melanogaster (Oregon-R-C) it may be thought 
that this will give a consistent bias in favour of this species, that is, that the 

relative fitness of melanogaster will increase proportionately with time. 

However, it should be noted that it is relative fitness that is being con- 
sidered, and the generation interval is a component of this parameter. 

The assumption of the Bernoulli binomial distribution in defining the 
‘random aspect of the interspecific competition in effect means that it is 
assumed that intraspecific and interspecific competition are randomized. 
Moore (1952) observed, however, that oviposition sites differed in the two 
species. Although both species found the edges of the medium surface 
‘the most desirable, it was found that semulans would deposit eggs in the 
centre more readily than melanogaster. At pupation also, this tendency to 
localize differentially has been observed in these experiments and by 
Herskowitz (1953). Larvae of melanogaster generally move on to the glass 
walls of the medium bottle or into the polythene central chamber of the 
population bottle in order to pupate, while szmulans larvae tend to pupate 
on the surface of the medium. This fact, in addition to the difference in 
‘mean generation interval, probably accounts for much of the difference in 
fitness between the two species. The pupal survival of szmulans is probably 
less than that of melanogaster because, as the medium becomes soft from 
the larval activity, many of the pupae on the surface are either submerged 
in the medium or displaced so that successful eclosion is made more 
difficult, if not impossible. It might be thought that such differentiation in 
oviposition and pupation sites could well lead to heterogeneity because of 
non-randomness in the competition. However, once the eggs hatch the 
larvae are very active and can range throughout the medium. Further, 
neither the eggs nor the pupae are stages in the life cycle involved in com- 
petition, either intra- or interspecific. 

As in the populations where heterogeneity was significant, there was no 
suggestion of systematic deviations of the observed proportions from the 
expected, and as there is no evidence for any factor that would give rise to 
non-randomness in the interspecific competition, it would seem that the 
model does adequately describe the interaction between the species. It 
must be concluded that the heterogeneity observed is due to the fitness of 
the two species varying with time due to uncontrolled environmental 
conditions. . 

An explanation of the negative correlation between log relative fitness 
and log heterogeneity factor is simple. When relative fitness is high (say 
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> 30), the proportion of melanogaster in the population reaches 80 %, 
or more in one generation. Most of the information on relative fitness is 
thus contained in this first generation and it is consequently difficult tof 
detect any heterogeneity. However, if relative fitness is low, the increase 
in the proportion of melanogaster is much slower, so that considerable: 
information on relative fitness is obtained from all three generations. Any} 
fluctuation in fitness with time will then be more easily detected. There-| 
fore, for best use of this model, experimental conditions, the standard} 
population of simulans, and the initial ratio should be chosen so that all 
number of generations (if possible) give proportions in the range 10 to 90%. 
It has been shown therefore that this model may be used to estimate the 
relative fitness of different D. melanogaster populations. However, as} 
relative fitness is significantly affected by the experimental conditions, any) 
estimates so obtained are valid only for the particular set of conditions. 
used. Nevertheless, unless there are interactions between D. melanogaster 
population and experimental conditions, the ratio of log relative fitness o 
one population to that of another, as determined for a particular set o 
experimental conditions, will hold over any set of conditions. | 


We are indebted to Dorothy Allingham, Gillian Davey and Robin Johnston for | 
their assistance in the maintenance and scoring of the population bottles. This 
study has been supported by a University Research Grant to one of us (J.S.F.B.). 
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Thought-processes and certain typical mental phenomena are schematized 
into exact mathematical definitions, in terms of a theory which, with the 
assumption that learning is a relatively slow process, reduces to two sets 
of equations: “‘neuronic equations”, with fixed coefficients, which deter- 
mine the instantaneous behavior, ‘“mnemonic equations’’, which deter- 
mine the long-term behavior of a “model of the brain” or “thinking 
machine”. A qualitative but rigorous discussion shows that this machine 
exhibits, as a necessary consequence of the theory, many properties that 
are typical of the living brain: including need to “‘sleep’’, ability spon- 
taneously to form new ideas (patterns) which associate old ones, self- 
organization towards more reliable operation, and many others. Future 
works will deal with the quantitative solution of these equations and with 
concrete problems of construction—things that appear reasonably feasible. 
With a transposition of names, this theory could be applied to many sorts 
of social or, more generally, ‘“‘collective”’ problems. 


1. Introduction 


A. LEVELS OF APPROACH 


Attempts at a quantitative understanding and analysis of thought-|f 
processes, with or without the explicit aim of devising machines that 
should reproduce functions typical of the living nervous system, date a 
far back as Ramon Lull’s syllogistic wheels. They have become a recog 
nized and major part of scientific investigation since N. Wiener’s cele 
brated enunciation of the principles of Cybernetics; herein lies indee 
clearly, much more than in specialized studies of circuitry or of informa- 
tion theory, the heart and scope of this new science, which aims at synthe- 
sis as well as analysis. . 1) 
The investigation of the mechanism of thought has been undertakeni 
with a variety of methods, ranging e.g. from the study of systems that 
should mechanize the operations of Aristotelian logic without any require- 
ment of similarity to living structures, to the faithful electronic reproduc 
tion of populations of hundreds or thousands of neurons. We shall benefi 
204 
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from all these discussions in that they permit us to reduce the verbal 
presentation of our own concepts to a bare minimum, since they have 
made abundantly clear with what cautions and restrictions one should 
accept for example the very expression ‘‘mechanical thought”; otherwise 
we shall restrict our treatment exclusively to the presentation of our 
approach to this problem, as we feel that in such a field judgment is passed 
better a posteriori than a priori, on the ground of concrete results—which 
are yet to be borne by any theory, including ours—than of mere opinion. 

The present outline of a theory of thought-processes is the result of 
about three years of discussions with people who have been working with 
the same premises in various fields of neuroanatomy, mathematics and 
theoretical physics; it also reflects, of course, the evolution of our own ideas 
through many discussions with guests and hosts. Our main guiding prin- 
ciple has been the conviction, strengthened by these discussions, that the 
human brain, tremendous in its complexity, yet obeys, if one looks at the 
operation of individual neurons, dynamical laws that are not necessarily 
complicated; and that these laws are such as to engender in large neuronal 
assemblies collective modes of behavior, to which thought-processes are 
correlated. A convenient formulation of these laws appears therefore as 
the primary objective of a research of this nature; it can only be achieved 
by trial and error by the process, familiar in the physical sciences, of 
abstracting what seems relevant into a simplified model of the real thing. 
The present work is one such trial; its novelty is not, of course, in the 
concepts just mentioned, which are as old as physics itself, although they 
have not yet gained general acceptance among neurophysiologists, but in 
the attempt made here to give them a precise and quantitative formulation. 

Constant resort to neuroanatomy and neurophysiology, which is the 
keystone of our approach, appears necessary at two different levels: the 
“elementary level”, which studies the individual neurons and the con- 
nections, or synapses, between neurons; the “integrative level”, which 
studies the structure and function of specially connected assemblies of 
neurons, which may act as a whole and play in the nervous system a réle 
similar to that of specialized organs in the body. The integrative level 
compares with the first as the physics of matter does with that of the 
atom, and is of course as essential to the understanding of the functions 
of a brain or of a thinking machine; we firmly believe, against the opposite 
views which we have heard expressed, that a study at the elementary level 
is as essential to the second as one type of physics is to the other. 

We shall have very little to say here about the integrative approach, in 
which many more investigations are needed before a satisfactory state of 
knowledge is achieved, except that our equations, once the appropriate 


connections among neurons of a given assembly are introduced into them, 
14—2 
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will permit the quantitative study of its collective behavior as a whole. } 
It is our belief that the subsystems of a brain are quite different, in struc-} 
ture and complication, from the standard circuits of electronics, and that } 
there will be a great deal to learn in this respect from neuroanatomy; also, 
that a thinking machine built for some special purpose may well need. 
organs, or subsystems, organized quite differently from those of the: 
animal brain, although the same elementary laws will be valid. Our| 
equations are also intended to provide a useful tool both for theoretical } 
study and for experimentation in this respect. i 

Because of the lack of definite knowledge and of general agreement | 
among specialists on many facts of neuroanatomy and neurophysiology 
on one hand, and of the great wealth of available observational material 
on the other, we think it best to present our views as the direct description, 
reduced to bare essentials, of a model of the brain forgoing the detailed 
analysis of anatomical data from which, in fact, our considerations stem. 


| 
B. THE MODEL | ) 


By “model” or “machine” we mean exclusively a device that can | 
actually be built, and which operates according to mathematical equations 
that are exactly known and numerically solvable to any wanted accuracy. 
Although this necessarily implies drastic schematizations and simplifica- 
tions, it is hoped that the features essential to thought-production are 
retained by the model; successive approximations to reality will require 
improvements in the structure of the machine and in its operational laws, 
but at each step one must know exactly what is being done. Without a 
complete mathematical control of the situation, a machine may perhaps 
think, but one would hardly know why or how. 

Mathematically, our model consists of two sets of equations: the 
“neuronic equations”, which describe the instantaneous operation of the | 
machine; the “mnemonic equations”, which describe the growth of 
memory into it. From these equations it is possible to predict and study 
the “mental’’ phenomena which are typical of such a machine: learning, |] 
forgetting, re-integration, conditioning, analysis of patterns and spon- 
taneous formation of new patterns, self-organization into reliable opera- |] 
tion. An exact mathematical definition is given of each of these pheno- | 
mena; that they do actually take place is shown, qualitatively but | | 
rigorously, from the form of both sets of equations; methods for the | 
quantitative solution of these are in part already available and will be dis- | | 
cussed in a future work. Likewise, although we are actively engaged also 
in the study of the concrete aspects of the question, we shall limit the 
present report to an outline of the mathematical theory. | 


| 
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C. NORMAL PHYSIOLOGICAL MODEL 


Finally, we wish to emphasize that our machine does not purport to 


realize necessarily an anatomical model of the brain, that is, there need be 
“No one-to-one correspondence between the anatomical neuron and the 
basic unit of the machine; we are concerned here only with the des- 
-cription of a physiological model, in which, as a whole, it is irrelevant 
whether the functions of a single neuron are taken up by a single unit or 
_by a group of units in the machine, or vice-versa. Likewise, one could 
reproduce the functions of a circuit containing electronic tubes of various 
descriptions in terms, say, of a model circuit containing only triodes. 
We wish to emphasize also that our model intends to simulate the phy- 
-siology of neurons in their normal condition in the living tissue, and not at 
all the various reactions they exhibit when tortured in the physiologist’s 
laboratory: most of the latter will be as irrelevant to the study of the 
collective behavior of neuronal assemblies, as is the detailed knowledge 
of the radiation spectra of Na and Cl ions to the determination of the 
crystalline structure of the NaCl salt. 
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2. Symbols 


unit step function 

Stieltjes integral or summation 

indices denoting integers (subscripts of superscripts) 
fixed integers 

real numbers (coupling coefficients) 

real numbers (thresholds) 

time variable 

a fixed “‘time quantum” 

piece-wise-constant functions (-o or 1 in any quantal 
interval of time) 

class of all functions w,/(t) 

a constellation of neurons at time t 

number of neurons of c;(t) 

a fixed set of neurons 


= classes of solutions of eqs. (2) relative to E. 
= group of transformations under which eqs. (2) are in- 


variant 
configuration space 


= representative point of system in.” 
= a functional space built over Y 


a frame in F 
patterns presented to, or constructed by, the machine 


208 E. R. CAIANIELLO 


f (a,,3 4) = secular equation with variable A 
rd, = eigenvalue of f (a,,; A) = 0 
Pz = small random variation of a,, 
p, <8A,> = average values of p,, and of corresponding variation of 
Ar | 


3. Neuronic and Mnemonic Equations 
A, GENERAL REMARKS 


1. The present considerations aim at simplicity, rather than at formal 
elegance; many restrictive assumptions are therefore made which could 
easily be relaxed, gaining thereby a greater apparent generality in our 
equations but, in reality, only complication which is better avoided at this J 
early stage. The most evident is the fact that we use throughout sum- 
mations instead of integrations, although Stieltjes integrals would be in 
many cases more appropriate to a faithful description of the anatomical 
situations of interest. 

Instead of considering the actual speed of propagation of the neuronic | 
discharge along dendrites and axones, we neglect the first and lump the 
second together with the synaptic delay into a single time-unit 7; this is | 


a better approximation than it may seem, because in the brain, as is well | 


known, speed in axones is proportional to diameter and, although less 
generally, diameter to length. We schematize this situation by assuming 
that a neuron which receives a pulse (either does not fire or) fires after | 
exactly 7 sec; or, more generally, that ts denotes some conveniently small 
“time quantum’’, of which the neuronic delay times are (not necessarily 
equal) multiples (our neuronic equations (2), although apparently designed 
to describe only the first situation, also cover the second). 

2. We shall base our treatment on two sets of equations: the neuronic 
equations (N.E.) which have constant coefficients and determine the in- | 
stantaneous behavior of the system, and the mnemonic equations (M.E.) 
which account for the semi-permanent or permanent changes in the 
structure of the system caused by its past operation. This is, again, an 
artificial simplification of the actual situation, which is better described by 
retaining only the first set of equations, with coefficients taken as “‘slow” J 
functions of time and past neuronic activity. The approximation thus 
made is analogous to the Born-Oppenheimer approximation of molecular | 
physics, which consists in studying first the motion of the (much faster) 
electrons as if the nuclei were fixed, and then the behavior of the latter. It 
is justified physiologically by the experimental observation that electro- 
shock, or concussion, cancels all memories of things learnt within a pre- 
vious time interval of minutes or more, while memories acquired before 
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that time remain unimpaired: this makes it reasonable to assume that the 
brain takes about that much time to change the dynamical phenomena 
which we consider here to be the carriers of functional, short-range 
memories, into semi-permanent or permanent alterations. That the 
latter actually exist is proved by the fact that they are not suppressed by 
hybernation or artificially provoked cessation of all neural activity. 

We may call this the adiabatic learning hypothesis (A.L.H.): the degree 
of adiabaticity of learning in the brain can be estimated roughly from the 
remark just made, with the conclusion that the engramming of perma- 
nent or semi-permanent memories takes roughly a time of 104 to 105 sec 
or more. The determination of the duration of semi-permanent memories 
in the brain is a task for experimental psychology, and is not discussed 
here. 

The mathematical advantages of uncoupling the actual equations of 
neural activity into two distinct sets by means of the A.L.H. will be 
evident: by considering all constants frozen, the resulting N.E. are solv- 
able notwithstanding their utter non-linearity, and in any case their very 
form leads immediately to many interesting qualitative conclusions, as we 
shall show later. 

3. It is perhaps relevant to emphasize that the equations which we shall 
take as the basis of our treatment do not certainly contain, in themselves, 
any striking novelties. They are about what any neurophysiologist would 
write at once, should he wish to arithmetize, say, the kind of logic that is 
usually associated with neuronic circuits, or to formulate some reasonable 
guess about the growth of memory. 

What we consider to be the essential point in our whole theory is, 
rather, the fact that arithmetization is considered here as the necessary 
first step: once equations are written, then, and only then in our opinion, 
the real groundwork can begin. Furthermore, equations alone mean very 
little to a mathematician; the detailed prescription of the type of informa- 
tion which is wanted from the solutions of a given equation constitutes a 
“problem”’, the formulation and solution of which is, in all cases, the most 
relevant question. We shall therefore be concerned here essentially with 
the formulation of problems which arise from these equations and are 
central to our theory of thought-processes; in so doing, we shall meet 
interesting and novel mathematical situations, the quantitative study of 
which is well under way and will be reported in the future. The qualitative 
discussions of Sections 4 and 5 will suffice for our present purposes. 


B. NEURONIC EQUATIONS 


1. We take as the basic component of the machine—which for con- 
venience we call a “neuron’’, although its functional relation to living 
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neurons need not be 1 : 1—a discriminator with a large number of inputs _ 
(dendrites) and a large number of outputs (branching axones). Signals 
can only travel wnidirectionally, with infinite speed, from the output of a 
neuron to the input of the neurons connected to it; when a signal reaches | 
a neuron it is annihilated, unless enough signals arrive with it to cause the | 
neuron to fire a pulse, after a delay 7, simultaneously in all its outputs. The 
intensity of these pulses may vary with the “‘anatomy”’ of the neuron, i.e. 
number of inputs, outputs, location in the machine, etc.; such pulses may 
be attenuated during propagation, or other phenomena may occur, as is 
discussed later. As a matter of formal convenience, we normalize all pulses 
to unit strength and account for larger or smaller strengths by giving 
suitable values to the coupling coefficients. Finally, a neuron will fire only 
if the total sum of afferent pulses is greater than its threshold. All coupling 
coefficients and thresholds are considered to be constant (adiabatic learn- 
ing approximation). 
We define the function: 


| 
= ei ae s| 
| 


oforx 24 0 


let u,(t) denote a function belonging to the class U of piece-wise-constant 
functions which are either constantly o or constantly 1 in any of the inter- 
vals Jr, (J + 1)r (J integer = 0); we take then as fundamental equations 
for the description of the instantaneous behavior of our machine 
(neuronic equations, N.E.): 


u,(t +7) = 1[2, ayy u(t — rr) — $,] (2) | 


The meaning of the coefficients a{’) and s, is stated below; the anatomy of 
the machine at a given instant is described entirely by their values. 
(Taking pat instead of 7 at I.h.s. of (2) would not change the structure of 
these equations: an obvious re-naming of their coefficients would lead 
back to the form (2).) 

2. S,, usually > 0, is the threshold of the neuron h; the neuron h fires. 
at time ¢ + 7 if its excitation at time t (given by the sum in (2) ) is greater 
than s,,. | 

ay, (k #h) is the coupling coefficient that transfers the pulse originating 
from neuron k to neuron f; it contains the total effect of the first on the 
second, regardless of the number of synapses between k and h and of the. 
intensity with which the stimulus coming from k reaches h along each| 
pathway. When aj, 0, we say that there is a (unidirectional) direct 
channel between neuron k and neuron h, which causes a facilitation 
k +h if a{) > 0, or an inhibition k +h if a® <o. 

The réle of the coefficients ajj) (kh ~k) and al’ is quite different: 


hh 
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ay, (h +k; r integer > 0) is “0 only if it is required that the actual 
mechanism of stimulation be such that the ‘effect of the pulse from k 
may reach f, or last on h some time > 7 after k has ceased firing; this 
would be the case if stimulation were due, say, to some transmitter sub- 
stance released at the synaptic junction, which would be re-absorbed only 
after a time > 7. Such a mechanism would account for latency and be 
related to the well-known dependence of pulse frequency on intensity of 
stimuli. It may not be a bad approximation, in a model, to take a¥) = 0 
for h =k, r > o, except perhaps for input elements. 

The coefficients aj;) express instead the memory that the neuron A 
retains of each of its firings (in the brain, for about 1007 sec). For all we 
know, the characteristic observed shape of the neuronic discharge (as 
well as many other things) may well be only the result of biological 
necessity, and to ask that it be closely reproduced in a thinking machine 
might prove as binding as demanding that moving objects be built with 
legs rather than wheels. We shall want in any case ai) <o for all values 
of ry from r = o until rr becomes greater than the absolute refractory time 
of the neuron; for the latter and higher values of 7 it may be convenient 
to follow different prescriptions, according as one wishes to study the 
actual behavior of the brain on this model, or instead to construct a think- 
ing machine for some special purpose. 

3. As an example (among the many that might be produced) of the 
fact mentioned earlier that our N.E. might be a poor description of the 
anatomy and yet give a faithful description of the physiology of a nervous 
system, we consider here the situation that would arise if, in a nerve, or 
bundle of fibers, the electrotonus due to axones which are carriers of 
pulses should induce firings in other axones of the same nerve which 
originate from neurons that have not fired. 

This possibility was not contemplated when writing the N.E. (2). A 
model which reproduces also this new type of behavior must lead to 
equations such that signals can be either transmitted directly from neuron 
k to neuron h, or induced into the channel k Ah by the firing of some 
neighbouring neurons; the neuron h must not be able to discriminate 
whether the pulse it receives through that channel has a direct or induced 
nature. Taking for simplicity a? = 0 forr > 0 (h #R), we obtain clearly 
the wanted equations by replacing Sa?) u,(t — rr) in (2) with 

Ue 


aly u(t) + > bie e,(t) (3) 
kixtk 


where > means sum over the neurons ki, neighbours of k, the axones of 
ki tk . 
which can act in this way on the channel k — h, and bj;) are some suitable 


coefficients. 
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It is then evident that, renaming the coefficients, one finds again N.EJ 
of type (2). The same can be said for inter-dendritic interference. 


C. MNEMONIC EQUATIONS 


1. There is sufficient evidence to prove that memory in the brain isi 
due both to functional processes and to reversible and irreversible altera- 
tions of its micro-structure. Very little, if anything, is known for certair 
beyond this, so that we are forced to rely upon “plausible’’ hypotheses} 
if we wish to assign the specific laws which determine semi-permanent o1f 
permanent physico-chemical changes. We shall not hesitate to do so fos 
the sake of concreteness; we wish however to emphasize that the quali- 
tative analysis of thought-processes which is the purpose of this work does 
not require precise knowledge of these laws, but only that they share some 
very general features, which may be assumed with much greater reliability 

Here lies a substantial difference between the brain and the thinking 
machine: the latter, which is obviously not restricted by the severe limi 
tations of biological necessity, may have mnemonic devices and la 
much more efficient than those of Nature, while giving rise to thought 
processes (as described by the N.E.) of the same type. We feel also that, ag 
regards memory growth and contrary to the situation that arises in the 
study of the N.E., a thinking machine of this sort might be of greater us 
to neurophysiology than vice versa; observations performed on models 
which can be built with mnemonic laws changeable at will, might help t 
shed light on the quantitative aspects of biological phenomena which ar 
extremely difficult to observe directly. 

Thought-processes in a portion of the cortex may be ascribed either t 
excitation of neurons which would be otherwise mostly at rest, or to in 
hibition of the activity of neurons which would be otherwise unceasingl 
firing. To the first one would associate mnemonic mechanisms whic 
make firing easier with the progress of learning (this we may call 
facilitatory, or positive, type of memory); the opposite with the secon 
(inhibitory, or negative memory). Both types offer interesting possibilities 
for machine construction; since they obey essentially the same kind 0: 
N.E., we refer here throughout only to the first type. 

The so-called “genetic”, or “anatomical”, i.e. permanent inheritecl 
memory, corresponds clearly in our description to the fact that, as we shal 
see, some (actually most) of the coefficients which couple neurons to-f 
gether must be taken initially, and kept throughout, vanishing. Ouil 
mnemonic laws will therefore be chosen so that if a coupling coefficienif 
vanishes initially, it stays forever so, while its modulus may grow to maxi 
mum value from any given initial non-vanishing value. 

In our model, thought-processes will be represented by non-trivi 
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solutions of the N.E.; the machine can also “think” therefore if all co- 
ficients in the N.E. stay forever frozen, i.e. if the machine cannot learn 
or forget, provided these coefficients have convenient values. The present 
framework can thus account, as it should, for a clear distinction between 
“instinctive” and “intelligent” behavior. It is natural to suppose that 
genetic patterns determine the laws according to which cells duplicate, 
branch out and anastomize, rather than the actual ultimate detailed ana- 
tomy of a tissue (thus, a “‘gene’’ carrying the instruction ‘add + 1” 
would suffice to generate all integers from zero, while an infinite number of 
“genes” would be obviously required if each integer should have its dis- 
tinctive “gene’’); then even a few mutations may determine the appearance 
of neural structures quite at variance with previous patterns, from which 
the evolutionary laws can secure the selection of the fittest, that is those 
which possess the most favorable neuronal couplings. Our definition of 
thought comprises thus two types of performance for which we use the 
conventional terminology: “‘instinct”, which is learnt genetically, and 
‘intelligence’ which arises when these couplings can change during the 
life of the individual. 

- 2. The quantities s,, a\/) and al’) (hk) were seen to play quite 
different rdles. When assigning their variation with time, we refer hence- 
forth to a machine rather than to the living brain, for the reasons men- 
tioned before. 

It is apparent from (1) and (2) that the maximum learning capacity of 
-he machine is already reached by assigning suitable variations only to 
a) and a’). Once the mnemonic laws are given for these, changes in- 
duced in the s, appear as the best way of controlling the operation of the 
machine. We shall return on this point in Section 4 and consider here the 
‘, as quantities the values of which do not change because of mnemonic 
‘aws, but, if at all, through some different mechanism. 

The coefficients a!) have already been discussed in B, 2, p. 210; for the 


( 


a of the present discussion we may assume. 


(4) 
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(5) 


vhere a Sf” >0, A” >, and it is imposed that aj (t) be con- 
inuous, with a\%) (0) 4 Aj?. 
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For the sake of concreteness we take (5) as the mnemonic equations§ 
(M.E.) of our machine; we also neglect inhibitory (negative) couplings, . 
to which (5) is immediately extended in an obvious manner. We may} 
suppose here, for simplicity, that only coefficients with r = 0 survive,, 
and that all AS) — A and all a\ = a. We have already emphasized that 
all that we actually need are M.E. that admit solutions having the same: 
qualitative behavior as those of (5); these we proceed to discuss briefly.. 

3. We write a,,(t) for a\) (t). The M.E. (5) describe a situation in whichg 
a,,(t) never becomes smaller than a > 0, nor greater than A. When the} 
latter value is reached, it is retained for ever: the information is engrammed: 
permanently. This is perhaps an oversimplified view of the real situation, 
in the brain; it could be, though, easily modified. 

a,,(t) increases if, and only if, the neuron h, which is connected by a 
direct channel to neuron , fires at time ¢ + ¢ and has received a pulse ati 
time ¢ — rt from the latter. It decreases slowly afterwards (a > f), until the 
same situation repeats. Only if a series of such rises occurs, withoutj 
excessive delays in between, can a,,(t) reach the engramming value A. 

There is ample choice of mechanical devices which can reproduce: 
qualitatively this behavior. If it is desired that the machine exhibit 
behavior typified by (3), coefficients like 5? might be given constant 
values, not subject to mnemonic phenomena. 


4- Qualitative Discussion 


A. OPERATIONAL DEFINITION OF ‘‘THOUGHT’’ 


1. We propose now to show that, as was mentioned in the Introduction 
a machine that works according to the N.E. (2) and the M.E. (5) will 
exhibit phenomena which are typical of a nervous system, provided off 
course the number of its elements is sufficiently large and the initial values 
aj (0) of the couplings among these (‘“‘genetic memory’’) are convenientl 
chosen (e.g. so as to prevent “epilepsy’’: cf. c, 4, p. 221). 

The most obvious features of the N.E. are non-linearity and uni+# 
directionality of pulse transmission; their solutions describe therefore 
in any case states of excitation (or “motions”, or “modes”) that “travel?” 
unidirectionally from neuron to-neuron and interfere nonlinearly when 
ever they meet. This interference is either instantaneous or nearly so, as it 
happens when summation of pulses at the synapsis of a neuron causes its 
firing (as described in the r.h.s. of (2) ); or delayed, as it happens whenif 
pulses, which would otherwise cause the firing of a neuron, cannot do so 
because they reach that neuron when it is still inhibited by a previous 
firing, due to different pulses. 
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2. We define a thought-process, operationally, as a solution of the N.E., 
or, equivalently, as the corresponding ‘“‘motion’” in the machine. It is 
convenient further to qualify this definition, so as to meet obvious 
objections. 

We may disregard as “trivial” and not consider as “thoughts” solutions 
that correspond to (say accidental) firings of neurons at a given time, such 
‘that no other neurons are induced into firing thereby and all activity ceases 
immediately afterwards. Any “thought” implies thus the passing of at 
least one neuronic channel. 

3. For any given set £ of neurons, all motions of a given duration can 
be classed either into a set M,, the motions of which cause at least one 


neuron in £ to fire at least once, or into a set M;, of the remaining possible 
motions. There is thus (and in many ways) the possibility of establishing 
operational distinctions between “types of thought’’; should, for instance, 
a portion of the machine correspond to the central and one to the auto- 
momic nervous system, the name “thought” could be further restricted 
thereby to the solutions of the N.E. which affect only the neurons of the 
first. If, in a different partition, E is the set of neurons the firing of which 
is associated somehow with consciousness (e.g. because they control a 
loudspeaker, or some prescribed feed-back mechanism), then all motions 
of M,, can be taken as representing the “‘conscious activity’, all those of 
M,, the “subconscious activity” of the brain. 

It is interesting to remark that, in the latter instance, because of the 
various possibilities of interference discussed before between the motions 
of M,, and those of M,, each type of activity influences the other. 
“Psychoanalysis” reduces for this machine to a simple and well-defined 
mathematical problem. 


. B. PROBLEMS CONNECTED WITH THE N.E. 

' 1. The N.E. clearly contain, as special cases, the description of all 
ogical networks of the kind beautifully analyzed in the pioneering work 
bf McCulloch and Pitts. Should their solution be attempted by the obvious 
method of iteration, they would, for these cases, give just as much—or as 
ittle—information as can be gathered from the standard logical switch- 
yoard analysis; there is here a clear analogy with the Darboux (better 
than the Cauchy) problem of the theory of differential equations. 

| The systematic algebraization of logic, which is the real content of the 
N.E. (with frozen coefficients), permits us to pose for them much more 
yeneral questions, which may be treated with a variety of mathematical 
ools; the logic of the system is seen to play a réle so to speak similar to 
hat of the constraints which limit position and mobility of a dynamical 
ystem; an appropriate treatment of the N.E. will permit, as with the 
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equations of motion of dynamical systems, the search for those long-range | 
collective solutions which, in our scheme, form the basis for a useful 
analysis of thought-processes. 

2. We consider first of all the N.E. with frozen coefficients, in keeping | 
with the A.L.H. Their quantitative discussion poses some interesting and 
novel mathematical questions, and will probably require the introduction 
of techniques ad hoc; on the other hand, it is evident that in simple cases, 
such as may correspond to situations involving very few neurons, the 
N.E. may be solved on inspection. It is also clear that straightforward 
combinatorics can give useful information on the possible types and 
multiplicities of the solutions of interest, as defined below; and that this 
can be translated at once into the customary language of “excitation 
probabilities”, etc. While deferring to future reports for detailed studies J 
on these matters, on which work is in progress, it is fully sufficient for our 
present purposes to formulate the ‘“‘problems” which we envisage as most 
relevant in study of the N.E., and to discuss them briefly at a qualitative 
level. 

The first obvious, and obviously important, remark is that the N.E.\f 
are not uniquely determined; their formulation (2) is perhaps deceptively § 
simple. Because of the definition of the function 1(x), there is a whole# 
group G of transformations which change a given set of N.E. into an 
equivalent one—having, that is, exactly the same solutions, although not 
necessarily the same form (thus, 1(«) = 1(2x) = 1(x’) = 1(sin x), etc.). 
This fact was already used in the discussion of the threshold values s, 
made in Section 3, C, 2; it shows, for instance, that matrix algebra should 
be used with caution in handling these questions. 

For the same reason, “suitably small” changes of the a”) and s, will 
not change the solutions of a set of N.E.: this adds credit to the reliability 
of the A.L.H. and provides what we may call the first criterion of stability 
of the machine. 

3. We shall soon specify what types of “input” and “output” seem. 
most appropriate for a machine of this sort; we are now interested in theif 
“spontaneous” activity of the machine, which we define as that whichiff 
takes place in it when, at a given time ¢), the machine starts from any 
given state of excitation and no input pulses are fed into it for t > ty. 

In a linear network—it is convenient, for purposes of comparison, to 
refer to a system of harmonic oscillators with linear couplings—such 
activity is naturally analyzed in terms of eigensolutions, eigenfrequencies,, 
harmonics; the behavior of a single element is in general not periodic, 
but simple periodal analysis will resolve it into a sum of periodic normal 
modes, which have a collective character and may be defined as the 
motions of quasi-particles (this remark already suffices to eliminate a 
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illusory any attempt at deciding on the existence of periodic motions in 
the brain through observations performed on one or few neurons). In a 

non-linear system things become much more involved; e.g. one finds in 

general, besides harmonics (multiples of a fundamental frequency), also 
_subharmonics (multiples of a fundamental period). 

The extreme schematization which is expressed by the form (2) given 

here to the N.E. has the evident consequence that one can only expect sub- 
harmonics; if there are periodic solutions of the N.E., these are reverbera- 
tions, 1.e. transfers of excitation from neuron to neuron which may reach 
/anywhere into the machine and, after the closing of suitable multi- 
' channel paths, repeat with a periodicity which is, obviously, some integral 
_ multiple of r. 
The consideration of reverberations is central to our approach. There 
' are tremendous numbers of them even in the simplest conceivable models; 
_their types, paths, multiplicities are determined by the coefficients a’) 
and s, of the N.E., and change therefore, because of the M.E., with 
learning and forgetting. 

4. The first mathematical problem is therefore the determination of all 
the solutions that correspond to reverberations, or free (“‘spontaneous”’, 
“autogenic”’) modes compatible with the N.E. The minimum duration of 
a reverberation is clearly determined by the refractory period of the 
neurons through which it travels; if we assume that “‘normal’’ activity 
(i.e. without special stimulation) of the neurons in the brain uses the total 
‘period of the pattern of spike-afterpotentials (~ 100 msec), then the 
}maximum possible frequency (reverberations involving ~ 100 neurons) 
is about 10 cycles/sec, which coincides with the frequency of the a-waves 
of the E.E.G. If we assume further that stimulation may force the neurons 
of the brain into using a refractory time intermediate between the absolute 
+(~4 msec) and the total time, then this maximum frequency increases 
jand the number of neurons necessary for the smallest permissible rever- 
|berations decreases. We do not wish to draw any conclusions at this early 


| If thresholds and couplings have the values that are observed in the 
brain, then reverberations certainly involve several tens of neurons. 
/ Reverberations, furthermore, should last for ever in an ideal machine, a 
conveniently long time in a real machine and in the brain. From the first 
Hremark it follows that one cannot expect to observe direct evidence of 
‘prolonged autogenic activity in a portion of the cortex in ordinary con- 
iditions: this would require innumerable microelectrodes stuck into as 
{many neurons for an experiment to be feasible. One would expect, how- 
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ever, from our theory, that if thresholds are sufficiently lowered artificially | 
or the intensity of stimuli increased, then also a very small number of} 
aptly chosen neurons should suffice for a prolonged autogenic rever-. 
beration to take place. In a brilliant series of experiments A. Féssard J 
(Symposium on Memory, Naples, 1960) has demonstrated, by using; 
tetanic potentiation, that this actually happens: he recorded reverberations 
among only four neurons which would last minutes. We regard his results| 
as a crucial, if only partial, confirmation of our theory, which was developed | 
while we were still unaware of his work. 

5. Reverberations, as all other motions, interfere non-linearly unless 
one reverberation never affects in any way the neurons of another, i.e. asi 
we shall say, is disjoint from the other. At this point the analogy with ag 
linear network breaks down completely, much to the advantage of our 
machine, which possesses many more essentially distinct modes of be 
havior than a linear system. It is still possible to classify all possible spon 
taneous reverberations, for instance according to periodicity, multiplicit 
(i.e. degeneracy), etc. 

The next mathematical problem that arises is the study of the evolutio 
of the state of excitation which was present in the machine at time 
t = to, as was discussed in 3, above. It may either coincide with a con-§ 
figuration of excitations which characterizes at f) a reverberation, and 
thereafter continues its periodic behavior; or, more often, decay, into a 
reverberation, or develop into a reverberation, or produce catastrophic 
behavior, i.e. lead to total (or nearly total) simultaneous excitation 
(“‘epilepsy’”’) of the neurons, which may decay immediately afterwards intd 
resti(ch the IN.E:); 

Excluding for the time being the last dramatic alternative, we find here 
the most interesting situation, as close an analogue as is possible with < 
non-linear system to harmonic analysis. In a frozen state of knowledge} 
out of the machine as many distinct responses can be evoked as there arg 
distinct excitable reverberations, or modes; each of these we may identify 
with a “pattern”? which the machine knows genetically, or has learnt; the 
“initial configuration”’ is the pattern which is presented to the machine| 
the set of (one or more) disjoint reverberations to which the latter gives} 
rise (depending upon the value of the couplings) is the analysis of tha 
pattern performed by the machine which corresponds to the state of know 
ledge it has learnt until that moment. 

Apart from learning, we have here the counterpart to what we regard 
as the essential activity of the mind, the ability to analyze a situation, on 
shape, or pattern, into a set of already classified patterns. No single ele: 


ment acts as a classifier since the total response of the machine is required 
for this analysis. 
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6. The situation described above is manifestly an extreme simplifica- 
tion. The next mathematical problem is in fact that of studying the evolu- 
tion in time of the total state of excitation of the machine when its 
“input” is subject to continued external stimulations. 

It will also be expedient, of course, whenever dealing with very large 

assemblies of neurons, to distinguish between “‘traveling’”’ and “station- 
ary’’ solutions. We have been considering thus far only the latter, but it is 
clear that, as soon as distinctive special-purpose “organs” are built into 
the anatomical structure of the machine, our previous considerations 
should be restricted mostly to the latter, with pulses travelling from organ 
to organ as among the boxes of a diagram. 
- It is also to be expected that there will be a maximum duration, and a 
maximum complexity, beyond which reverberations cease to be sig- 
nificant for pattern-analysis. This assumption, or requirement, will 
greatly facilitate the mathematical study of the problem formulated in 
this section. 

7. We have thus far taken, for the sake of simplicity, a perhaps too 
realistic view of reverberations as modes which are actually connected to 
fixed chains of neurons. This is not certainly the case when one considers 
the normal modes of linear networks, and it is therefore of interest also to 
investigate the possibility of resolving actual motions, which do not have 
manifest periodicity, into ‘‘normal’’ periodic collective modes (cf. 3, 
above: the Lissajous figures of linear problems are an example of this 
dehavior); any such latent periodicity would be easily revealed by obser- 
vations made upon populations of neurons (e.g. with the E.E.G.). 

' Questions of this nature, and many others, suggest themselves in a 
quantitative investigation; they need not be considered here in further 
detail. 

C. ROLE OF THE M.E. 

| 1. In the preceding section we have focused our attention on the 
»peration of the machine when all coupling constants and thresholds are 
sept fixed, and have found that reverberations play a central réle in its 
most typical activity, which is pattern-analysis in a very general sense. All 
such statements presuppose already, of course, the existence of favorable 
sonditions, as are expressed for instance by the assumption (A, 1), which 
srevent epileptic, or catastrophic, behavior; it was also implicitly assumed 
that the machine is indifferent to the ‘‘meaning”’ (referred to any standards) 
»f what it knows genetically or has learnt during its past activity. While 
we can reasonably expect that careful engineering and a long series of pains- 
aking adjustments would in the end produce devices capable of some 


iseful performance solely by virtue of conveniently chosen N.E., we are 


I 
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much more interested in machines that can adjust themselves to pre-} 
scribed tasks by means of some learning mechanism; this should also give 
the machine a tendency to organize itself into increasingly reliable opera- 
tion, so as to compensate for minor flaws in the accuracy of its elements | 

The M.E. provide, to a large extent, the answer to these questions, as} 
we shall now show. In the course of the same discussion it will becomef 
apparent, however, that a machine of this sort is not realistically con- 
ceivable unless at least two additional controlling devices are not alsq 
explicitly included; the first we identify tentatively with the thalamus, the 
second, with more assurance, with the reticular system of the brain. The 
necessity of devices of this sort, if not already suggested in the brain by 
anatomical and physiological evidence, is made imperative in the machine 
by the structure of the N.E. and M.E. 

Mentioning a “thalamus” takes us, of course, one step nearer to the 
“sentient”? machine than we wish to stay for the time being; we shal] 
therefore restrict this part of our discussion to barest essentials, pola 
only to what is relevant for purely “rational” thought. 

2. There are many mathematical ways of representing the overall situa 
tion and evolution of the machine, each suited to some special purposes; 
We mention here briefly a few which take the instantaneous state of each 
neuron as the object of interest. 

If the number of neurons in the N.E. is N, then a solution of the N. BI 
at time ¢ is representable by means of a one-column matrix with N rowsif 
the element of row / being given by u,(¢); or one can define, ABC 
an N-dimentional configuration (or neuron) space W, which has N axes} 
on the Ath of which the abscissa is u(t). The state at time t of the machine 
is thus represented by the point, or matrix, or vector, P(t) = {u,(t)} = u(t) 
its evolution in time by the (discontinuous) motion of the point P(t). 

All trajectories in 4 are invariant under the transformations of the 
N.E. which belong to the group G defined in B, 2, provided of course tha 
at each time ¢ one takes G as it is determined by the M.E.: now, G = G(2). 

NW contains at most 2” points; a trajectory in Y is a polygonal joiningy 
some, or all, of these points. A reverberation is represented in VY by ag 
closed polygonal (and lasts at least as long as the coefficients in the N. E| 
stay frozen). 

P(t) changes in (i) because it describes the evolution in time of aff 


intervention of the M.E. The A.L.H. permits the study of simple 
phenomena by separating step (i) from step (ii): it allows, that is, tha 
they be performed alternately. Step (ii) becomes necessary as soon as} 


because of M.E., the N.E. undergo a transformation which does not belon 
to G. 
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A qualitative discussion is better stopped here (see, however, Section 
5, A, 2); it should already be clear from what little has been said on this 
subject, though, that the introduction of spaces of functionals on W 
will be of the highest conceptual importance, because then everything 


_ becomes again linear, group theory may be resorted to as a valid tool of 


analysis, each “‘pattern’”’ is easily made to correspond to a point, and prob- 
lems such as those of language translation or study of emotive behavior 


can receive a precise mathematical formulation. 


3. The pattern-analysis described in B, 5, presupposes, clearly, that the 


machine has already formed, either genetically or by learning, some 


typical responses (or modes, or patterns), in terms of which a pattern pre- 
sented to it is analyzed. Very little, if anything at all, can be expected from 
a machine with fixed constants built entirely at random: the most likely 
thing to occur in such a case is that, unless the experimenter arranges the 


connections of the machine in a way that is equivalent to giving it a 
- genetic memory, the only resulting effect will be a total loss of information. 


Also a machine endowed with ability to learn will give, at best, a poor 


performance, unless the controlling devices mentioned above (1) are 
included into it, as we shall soon discuss. The best procedure, or at least by 
far the most economical, appears to be in any case that of borrowing as 


much as possible from anatomical and physiological information. 
Before proceeding further we need say a few words about the kind of 
“input” and “output” which seems appropriate to a machine of this 


nature. The notions of input as “that which comes before” and of output 


as “‘that which comes after’ the machine proper are clearly out of the 


question; we are interested in what occurs at all places and at all times 


in the machine, and a relatively small number of terminal plugs could 
never tell us readily this much. Adequate inputs and outputs are instead 


devices out of which (inputs) an afferent lead goes to each neuron of the 


machine, or into which (outputs) an efferent lead comes out of each 


-meuron; or, more economically, these leads connect input and output 
terminals with a large number of neurons spread throughout the machine, 


and connected with all the other neurons so that no relevant information 
on its behavior is lost. Anatomically, this seems to correspond to some 
regions of the brain stem for the afferents, for instance, to the thalamus. 
It is clear that such a device is what can be best desired for many sorts 
of feed-back operations; to this point we shall return briefly in connection 
with the learning mechanism. 

4. We have excluded, in the discussion made in B, 5, the possibility that 
a stimulus presented at the input may produce catastrophic, or epileptic 
behavior. In any real machine constructed with a very large number of 
elements and connections among these, however, and even with very good 
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engineering and planning, catastrophic behavior is the very first thing to J 
be expected. | 

Even assuming ideal starting conditions, the intervention of the M.E. J] 
will soon change the values of the a/}(t). The maintenance of a rever- | 
beration presupposes that, after its cycle is completed, the same situation — 
repeats identically. Unless the ratios of the numbers and weights (as given _ 
by the a(t) ) of output v. input terminals are kept within very critical 
limits (cf. the operation of a nuclear reactor), the most likely event to 
occur is that the initial situation does not repeat exactly after one cycle of 
reverberation is completed, but more, or fewer neurons are excited than | 
the correct number. In either case a process very similar to a chain reaction 
might take place immediately, that is, a very fast excitation of all neurons, } 
or a very fast extinction of all activity; more generally, totally uncontrol- 
lable phenomena would occur as a rule rather than as an exception. 

Whenever learning is involved, and in any case whenever design is not 
ideally perfect, a controlling mechanism (like the cadmium bars in reactors) 
is necessary to prevent any such possibility. We saw in Section 3, C, 2, that 
we can use, without any loss in the learning capacity of the machine, the 
neuronic thresholds s, for this purpose; this is also, clearly, the best 
choice from a practical point of view. 

Any machine that works according to the N.E. and the M.E. necessitates 
therefore a mechanism which, upon receiving information on its local and 
general activity at a given time, may alter the thresholds s, of the neurons, 
so as to avoid catastrophic conditions at later times. We have described 
here the function of the reticular system of the brain, as was made clear 
by the profound physiological investigations of G. Moruzzi & H. W. J 
Magoun. The existence of such a mechanism will provide the second | 
criterion of stability for our machine. 

It will also be necessary, of course, to resort to the methods of which |f 
Nature avails herself in the brain: most of the a%(o) will be taken as | 
vanishing, in such a way that a neuron be connected through its efferent 
terminals mostly to rather distant neurons. Thus, excitations spread out | 
and tend to stay below the epileptic thresholds; if couplings were only | 
with “nearest neighbours”, epileptic waves would be the only mode of | 
operation of this machine. Such a choice of initial coupling coefficients will 
provide the third criterion of stability. We may include into this the 
action of inhibitory couplings also. 

5. We can now discuss the operation of the machine from the point 
of view of the M.E.; learning and forgetting will play an equally important 
role. The cortex of the brain has many, more or less specialized input and 
output “‘areas’’; we shall refer, generically, only to “input” and “output” 
and refrain from using the current terminology of “sensory”, “motor” 
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and “associative” zones, which presupposes a more detailed structural 
knowledge than is needed for a qualitative discussion. 

We also forgo such obvious things as the convenience that special input 
devices be constructed so as to perform a preliminary analysis of the 
figures, or patterns, “‘shown”’ them (fed into them; we refer, for concrete- 
ness, to a visual input); thus, if a set of homothetic triangles is shown, one 
may require that the input device transmit to the direct input of the 
machine only the image of a standardized triangle, plus an information 
on the value of the homothety parameter. Devices of this sort are not hard 
to conceive as special organs or extensions of the machine itself, to which 
they are linked by additional N.E. with fixed coefficients (learning is un- 
desirable at this level). We refer hereafter only to the direct input of the 
machine, and assume that any such simplification has already been per- 
formed somehow. 

Without a thalamus, the machine can only learn by repetition from habit. 
Suppose it starts as a tabula rasa, i.e. with all couplings having values 
a\(o); suppose also, for the sake of simplicity, that a figure may be pre- 
sented to it any number of times, but each time only for a very short 
duration. Each presentation will stimulate into firing a number of input 
neurons which we assume to be sufficiently large to initiate a collective 
activity, which spreads into the machine as described in A, 1 and B, 3 and 5. 
Unless the collective motion thus induced is strongly favoured by the 
genetic memory (i.e. the aj//(o); this ought to be the case for such things as 
the infant’s sucking reflex), nothing much should happen after the virgin 
machine sees a figure Q, (say, a triangle) for the first time; reverberations 
will be evoked but, because of the low values of the coefficients af}(o), 
a great many pulses are required to cause the firing of each single neuron, 
so that periodic, or nearly periodic, modes may be expected to involve a 
great many neurons and to be quite slow. The activity thus induced will 
be rather diffuse, not yet quite specific; it will last for some time > + 
and the coefficients will start changing slowly because of the M.E. The 
pattern of this motion will be altered as soon as this change requires step 
(ii) as described in 2, p. 220: the increase of the coupling constants will tend 
to favor the creation of pathways through which reverberations are facili- 
tated, i.e. have shorter cycles and require less neurons. Normal biological 
disturbances, or mechanical variations, spontaneous (e.g. due to the 
variations of the coefficients caused by the M.E.) or imposed, e.g. by the 
reticular system, will cause this state of motion eventually to cease; the 
chance that a single direct channel be used as many times as is required by 
the M.E. for a permanent engramming of a facilitation is very low; what 
alterations have been caused by the sight of @, will be slowly forgotten by 


the machine. 


224 BE. R. CAIANIELLO 


Clearly, however, the thing is quite different if @, is shown many times | 
in succession to the machine: then the semi-permanent changes induced _ 


by the M.E. will accumulate, until permanent changes are induced that 


definitely facilitate the “most convenient’’ reverberations evoked by 0,, | 
i.e. those that are quickest and involve the least number of neurons; — 


these we regard as specific to @,. Permanent engramming is favored if the 
intervals between the exposures of @, are made shorter, disfavored other- 
wise; it may however take a very long time (days, or months, or years in an 
animal), or never occur at all: semi-permanent memories may decay very 


slowly indeed, and there may also be more stages, with various decay — 


times, before permanent engramming obtains, than is assumed in the form 
(5) of the M.E. 

It is evident that if a series of random figures is shown in succession 
to the machine, it will learn nothing, except perhaps a response meaning 
only that ‘‘a figure is being shown”’ (see later for a further discussion of this 
fact); but if among these random figures a given one @, is shown 
repeatedly enough, the machine will “learn” only @,, together with the 
general response mentioned before. 

This discussion is manifestly incomplete; diminishing response because 
of assuefaction, for instance, is not considered. The reason we do not wish 
to push it farther is that it appears too easy, rather than too difficult, to 
answer such questions at a qualitative level—which may be as dangerous 
as inconclusive. For instance, both the action of inhibitory couplings 
(which we have arbitrarily disregarded here for the sake of brevity, 
although their importance cannot be doubted) and of external inhibitions, 
e.g. from the reticular system may be invoked to explain assuefaction. 
Even with these restrictions, we hope that the present discussions of the 
subject may suffice to provide a convenient basis for further elaborations. 

6. Learning by repetition, as described in 5 above, does not appear very 


satisfactory. One may say that the virgin machine has no more reason to |) 


wish to learn a triangle than the infant child. The same mechanism, how- 
ever, makes it easy to explain “learning by punishment and reward’’, or 
conditioning of the first kind (as we shall say) and to account why the latter 
is much faster than the first. 

The machine must, of course, be sentient to some degree to know 
whether it is being punished or rewarded. This will mean here that the 


thalamus has in-built criteria (homeostatic devices) which enable it to ] 


“like” or “dislike” what it records. Suppose that this is the case, and that | 
the thalamus can either suppress, or create a state of excitation in the 
neurons. ‘Then, even when a stimulus is presented only once at the input, 


the thalamus may evaluate the situation through its homeostats and] 


determine either a quick suppression of it (or of a part of it, e.g. that | 
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which produces some specific motor pulses), or that it be reinforced and 
maintained until permanent engramming is achieved, much more quickly 
in this way, clearly, than by “‘external’’ repetition at long or random in- 
tervals. 

A thinking machine may perhaps do without this device and method 
of learning, which becomes of utmost importance only when survival has 
to be fought for. 

7. We have discussed in 5 (p. 222) the situation that occurs if a given 
figure, or pattern, 0, is presented repeatedly at the input. We consider 
now what can happen if two distinct patterns ©, and @, are used. 

If the machine has already learned 0,, ie. if it responds to 0, with 
specific reverberations, when @, is presented for the first time the situa- 
tion is not the same as at the origin of time; some facilitations are already 
formed in the connections, so that the machine, as its first reaction, will 
show all of the reverberations which ©, may evoke in common with @,, 
plus extra motions which will slowly be changed into a response distinc- 
tive of @,. In other words, if the machine knows already 9,, and sees 0, 
for the first time, its only immediate response will be to tell how much 
©, has in common with @,; later on it will learn also ©,. Thereafter, it 
will be able to analyze likewise 9, in terms of ©, and @,; and so on. 

This is, at its simplest, the mechanism of pattern-analysis, as it develops 
with the evolution in time of the machine—its “‘education’’. The analysis 
the machine is capable of performing at time ¢ is determined, because of the 
A.L.H., solely by the N.E., taken with the values their coefficients have 
at that time. 

8. The most typical and distinctive characteristic of the human mind 
is, in our opinion, its ability to abstract what is ““common”’ to two, or more, 
situations or patterns, and to retain the result of this operation as a new 
pattern, which is entrusted to the memory as if learnt from the outside. 
Just as pattern-analysis was seen to be the fundamental operation of a 
machine described by N.E. with frozen coefficients, we proceed now to 
show that abstraction is the other fundamental operation performed by a 
machine which obeys N.E. and M.E. as well. The same discussion will 
clarify also the exact meaning to be given to the word ‘“‘common”’ used 
above. 

At any time ¢ intermediate between the time fy at which a pattern 0, 
is activated into the machine (e.g. a triangle @, is shown at the input) 
and the time tf, at which the collective motion aroused by that act dies out 
(t,— t 7; no other such activations are supposed to take place, and the 
mechanism described in 7 above is excluded), there will be m,(t — to) 
neurons which actually fire, and form the constellation (set) c(t — to) 
engendered by the activation of @, at tj. There will be also, in addition, a 
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constellation ci(t — ty) of \(t — to) (in general, n’> n) of “penumbral”’ 
neurons which do not fire, but receive a subliminal facilitation from the 
activation of 0, at fo. | 

Suppose now that at some time to(t) < to < ¢,) another pattern @, is 
activated (a different triangle @, is shown at the input). The machine may _ 
have already learnt @, in a permanent or semi-permanent way, or not. 
Define likewise ,(t — t,) and ¢,(¢ — #4), etc., and neglect (this is incorrect 
and could easily be avoided in this discussion without altering its quali- 
tative conclusions) the alterations caused by the non-linear interference | 
of these motions at times between #, and ¢ in the constellations ¢\(t — to) | 
and c,(t — to); suppose that at least for some ¢ between t, and ¢, the inter- ] 
section of the sets c;(f — ty) and c,(¢ — tj) is not void and contains a con- | 
stellation c,.(t — t)) of neurons which (would receive only subliminal 
stimuli from either motion, but) fire because the non-linear summation 
of the stimuli from both motions exceeds their thresholds: then, c,. cannot 
be distinguished from a state of excitation such as would be produced by jf 
the presentation (not necessarily at the direct input which this, we recall, | 
has the same structure as the rest of the machine) of a pattern 0,5. 
@,., can be aroused, clearly, only if @, and @, are shown at the input, the 
second after the first, and will result quite differently, in general, from | 
@,,, 1f the temporal development of both motions and the interference 
effects between them are treated without the illegitimately oversimplified 
assumptions which were made here for short. | 

Although presented here in mere outline on purpose, this line of reason- |] 
ing makes it evident that, because of the non-linearity of the N.E., when- 


ever a pattern is activated while the response to a previous one has not yet |] 
died out, the machine can abstract something which is ‘“common’’ to both jf 
(the structure of the machine decides what meaning this word should |} 


have) and then adjust itself through the M.E. so as to memorize it, per- | 
manently or not, not differently from its normal behavior in response to | 
any other pattern. 

‘Patterns of patterns” of any sort, in any number, may be formed and 
learnt in this way: chains of abstractions can take place without limitations 
other than those imposed by the complexity and structure of the machine. 
We recognize here, in full, the mechanical analogue to the faculty of 
abstraction of the human mind. 

g. Abstraction alone is not enough: it would be highly uneconomical to 
remember all single instances, once the general concept is grasped. Our 
machine takes, in this respect, good care of itself. Suppose that, say, a 
sufficiently large number a of random triangles @, have been shown to the 
machine, which has learnt them semi-permanently; it has also formed the 
responses 0; 5, O,4, Q,53,---, O13... q= Owhichare common to all sub- 


THOUGHT-PROCESSES AND THINKING MACHINES 227 


sets of triangles (ordered or not) and memorized these semi-permanently. 
This memorization is accompanied by a process of facilitation; whereas, 
before memorization, @ could be evoked only if a// the triangles were shown 
to the machine, after facilitation, the al) that appear in the N.E. which 
describe the behavior of the neurons of the constellation Ce eG 
have increased their values considerably; suppose, for maximum sim- 
plicity, these values to have become so large that each single motion @, 
gives now to the neurons of c, instead of a subliminal stimulus, an excita- 
tion above threshold (if more than one, but less than a, @; were involved 
at this stage, our argument would only take a few more steps). Then each 
time any triangle ©; is presented to the machine, the common response 
@ (which can convey, clearly, only the ‘general concept’’ of triangle) is 
always evoked, i.e. the neurons of the ‘““common”’ constellation c fire and 
their channels are facilitated some more; on the average, the couplings of 
each constellations c; will be susceptible to increase for only 1/a of the 
total time, while those of c will be exposed to facilitating actions for all 
the time—until permanent memory is achieved. Even if at the beginning, 
say, c, had been markedly facilitated, the presentation of more and more 
patterns which have “something in common”’ with it (as decided by the 
machine) will cause the specific response to c, to fade into oblivion, while 
the “‘abstract’’, or ““common”’ response is evoked as the first thing, after a 
convenient learning period. 

If we consider now only two stimuli ©, and ©, and assume that 09, 
causes already, say, because of the genetic structure of the machine, a 
direct response (e.g. the food-salivation reflex of Pavlov’s dogs), while 0, 
is ineffective (cf. bell-ringing), then the same mechanism can be clearly 
extended to account first for the formation of the response @, , (bell before 
food, ~ @,.), then, for the fact that ©, alone comes to provoke the same 
effect as 9, or O,. The temporal behavior given by M.E. of type (5) is 
perfectly suited to describe the available evidence, which might be used 
for a determination of the numerical values of some of the constants which 
appear in (5). 

This type of conditioning is manifestly different from that described in 
6 above, which requires the intervention of the thalamus: we shall call it 
conditioning of the second kind, or “by information” (the bell just tells the 
dog that food is coming). 

10. We discuss, finally, re-integration, which we define as the fact that 
our machine shall, after learning a pattern, respond to the “incomplete” 
presentation of that pattern as if it were complete (cf. the familiar over- 
sights of the proofreader). ‘That this must be the case can be seen now 
quite trivially: after a number of facilitations have occurred, a smaller 
number of input stimuli will be required to cause the firing of neurons 
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than were necessary when that pattern was presented for the first time. } 
This also accounts for the children’s alterations of new words, which are} 
reduced to combinations of already familiar words; in part, for the fact } 
that a cue suffices to evoke a long string of memories, e.g. verses, etc. 


5- Concluding Remarks 


A. TIME EVOLUTION OF PATTERN-ANALYSIS 


1. The preceding discussion has been, at various places, restricted to} 
situations in which a stimulation, or presentation of a pattern, at the input: 
occurs at given instant, after which it ceases while the machine starts its} 
analysis of it; that is, we have chosen to consider only the “free” modes of} 
motion of the machine, of which the stimulation sets the initial conditions, 
rather than the evidently prevalent situations in which the machine will] 
perform ‘‘forced” motions under the continued influence of stimulations; 
which persist and may vary with time. 

| : 


This simplification is obviously convenient for the purposes of a purely} 
qualitative discussion, as it permits separate examination of the varia . 
features of the operation of the machine; nor can forced motions be} 
adequately described without a quantitative analysig of the solutions of the; 
N.E. and M.E. We wish to point out here, however, that this simplifying 
assumption is, in all likelihood, a much better approximation of reality |] 
than it may seem at first. 

We can consider a continued stimulation as the presentation to the 
machine of a time-series of patterns; its analysis by the machine is therefore! 
a serial operation. Our observation is, that a machine such as the one} 
envisaged by us, with a very large number of elements, can actually trans- |} 
form that serial operation into a parallel operation. | 

Let the patterns presented at the input (or anywhere by the machine to. 
itself: cf. Section 4, c, 8) at time 0, 7, 27, 37... be Oy, @,, @, ... . We rent 
call (Section 3, B, 2) that each neuron, as described by the N.E., has a refrac-| | 
tory period which is > 7, and may be > 7; we take it hereto be Rr, according jf] 
to the schematization expressed by (4). @, causes a set S, of neurons to fire at 
time 0; all neurons of Sy then stay dead for Rr sec, while other neurons are} 
excited by them at rather distant. places (3rd criterion of stability, Section} 
4, C, 4) and then remain dead in turn while stimulating other distant) 
neurons, etc. When @, is presented at time 7, all neurons of S, cannot i| 
respond, and another set S, disjoint from S, fire and excite likewise distant 
neurons, etc.; and so on far @, at time 27, etc. 1 | 

All the petteae presented fom t) until ¢ = Rr are therefore registered |] 
in parallel by the machine, which can thus, for example, abstract a con-| 
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cept meaning ‘“‘motion” with the same mechanism by which it abstracts 
one meaning “‘triangularity”’. 

The extent to which this happens is determined, in this simple example, 
by the value of R. If the reverberations which are significant for pattern- 
analysis have periods < Rr, then the simplifying assumption made in 
Section 4 is quite good. In any case, the conversion of serial into parallel 
operation, and vice versa, will emerge as an obvious and important feature 
in any quantitative discussion of the N.E. 

2. It is convenient at this stage to return briefly to the matters men- 
tioned in Section 4, C, 2, so as to summarize the essentials of the operation 
of the machine into a few mathematical concepts. 

Our machine learns, by the process of abstraction (by virtue of the M.E.), 
to perform pattern-analysis (by virtue of the N.E.). This sentence contains 
all that is most relevant in our theory; mathematically, it can be expressed 
as follows. 

At a fixed time ¢, the N.E. have frozen coefficients: A.L.H. A pattern 
@ presented at or into the machine at t evokes, in general, a very large 
number of disjoint modes, or reverberations: this is the ‘“‘Fourier’’ analysis 
performed by the machine, each mode corresponding to a point 2 or an 
axis in an appropriate functional space ¥* on VW (the quotes call to mind 
the profound difference from linear Fourier analysis). 

We may also say that each point 2 of ¥ corresponds to one of the 
basic concept, or words, which the machine has learnt until t: the N.E. 
contain, implicitly, all the knowledge or vocabulary of the machine, in 
terms of which each new pattern is translated by the machine. (This 
vocabulary we expect to have a surprisingly different structure from those 
of Western languages, in which a word is a “point”, and a sentence or 
definition is a “‘surface constructed point-by-point”; more akin to Chinese 
or Japanese, in which a word is a “plane” and a sentence or definition a 
“surface constructed as envelope of planes’’.) 

The second difference from Fourier analysis (the first being non- 
linearity) is that now the set of fundamental modes, or axes in F, is not 
constant, but changes in time because of the M.E. Thus, at a given ¢ a 
pattern is represented by a point in a given frame S in F (N.E.); this 
frame, however, is not fixed, but changes slowly (A.L.H.) in time (M.E.): 
S = S(z). 

This scheme, if correct, has profound implications: for instance, the 
efficiency of the machine will depend tremendously on the method followed 
in its education, because to the same external stimulus machines which 
have been educated differently may offer very quick and simple, or very 
slow and involved responses. 

3. An interesting consequence of our theory is the fact that a machine 
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thus constructed necessitates periods of rest, or “‘sleep’’. Indeed, even | 
giving it a tremendously large number of elements, the long duration of | 
reverberations (and induced motions in general), which goes far beyond } 
that of the actual stimulations, will cause, if stimuli are unceasingly offered 
to it, an ever increasing cumulation of activity; the possibility of co- 
existence of disjoint reverberations will become less and less, interference 
will cause ‘‘confusion of ideas” and inability to give correct answers even 
to familiar questions. 

A period of “‘sleep”’, i.e. cessation of activity through the suave inter- J) 
action of the reticular system, will permit the gradual extinction first of 
the less facilitated, then of all other reverberations, and the fading of semi- 
permanent memories of relatively short duration as well. A more quick 
and drastic treatment, such as “‘electroshock’’, followed by total quiescence 
for only the period of time which is required for such semi-permanent 
memories to disappear, will produce the same effect. This reminds one of 
some Yogi techniques which are said to achieve, in a relatively short time, 
the same state of rest which follows a full night’s sleep. 

4. As a final remark, we wish to point out that it might be interesting, 
in the light of the present considerations, to attempt an analysis of E.E.G.’s 
in terms of subharmonics instead of the customary one in terms of har- 
monics of a suitably chosen frequency. The amplitude of the E.E.G. 
recordings should depend strongly on the mechanism described in 5, A, I. 


B. SELF-ORGANIZATION INTO RELIABLE OPERATION 


1. In the study of any system containing a very large number of inter- 
acting elements built with realistic tolerances—let this be the machine 
which is being discussed here, or a bee-hive, or the whole socio-political 
and administrative framework of a nation—the central question is cer-’ 
tainly whether this system obeys instantaneous and evolutionary laws 
which guarantee its spontaneous convergence in time towards more efficient | 
operation, or whether the system may rather show erratic or divergent’ 
performance. This question was formulated with full clarity by|f 
N. Wiener, who, besides emphasizing the vital importance of it, gave also |ff 
powerful mathematical tools for its investigation; in his treatment the} 
non-linearity of the interaction laws was rightly stressed as the key to the 
whole problem. 

We can do nothing better than refer the reader to his work for a deeper 
elaboration of these ideas; it is important, however, here, to show that 
there is very satisfactory qualitative evidence that a machine obeying 
suitable N.E. and M.E. will satisfy Wiener’s principle of self-organization. | 
We shall keep this discussion at a qualitative level by resorting to physical 
more than to mathematical arguments, and by examining later in detail, 
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instead of the actual machine proposed here, a simplified functional model 
of it. 

That non-linearity of some sort is necessary for self-organization is 
physically obvious. A linear system (with frozen coefficients, or else non- 
linearity intervenes: cf. a, 2 above) cannot perform a spontaneous transition 
from one of its states to another (we use the language of quantum 
mechanics only because of its greater appeal to intuition; of course, the 
same is true classically); such transitions—without which the system could 
not choose spontaneously, from our point of view, states with a better (or 
worse) organization—are only possible if there are perturbations to cause 
them (e.g. the interaction with the electromagnetic field causes the quan- 
tum jumps in atoms); furthermore, the system itself must originate these 
perturbations (the electrons of the atom are the source of the electro- 
magnetic field in spontaneous emission), or else its changes of state would 
be “induced” rather than “spontaneous”. The equations of the system 
must therefore be non-linear in an essential way; such, for instance, as we 
have in electrodynamics when the electromagnetic field is expressed in 
terms of electron sources. 

Only a non-linear system of this sort can change its state spon- 
taneously; it will not be generally true, however, that its changes are 
necessarily “‘for the better”. Again, simple examples suffice to prove this 
statement; leaving aside those, plentiful indeed, which come from societies 
or civilizations that go bankrupt, we observe that the behavior of a non- 
linear physical system the energy of which is not restricted by a finite 
lower bound is certainly catastrophic. We expect therefore that such a 
system, in order to satisfy Wiener’s principle, shall obey additional 
necessary ‘‘convergence”’ conditions—of the type, for instance, that in 
quantum mechanics secures the existence of a ground state, which we may 
identify with the “‘best state for efficient operation”’. It is our belief that 
these conditions will amount to satisfying the third criterion of stability 
(Section 4, c, 4) and to choosing N.E. with the qualitative behavior 
exhibited by (5). 

The study of the restrictions that this criterion would impose on N.E. 
(2) and M.E. (5) appears to be a rather straightforward problem, which 
we add to the list of those which we formulated and set aside in Section 4 
for future investigation. A dynamical interpretation of (2) and (5) would be 
indeed quite natural, as the M.E. (5) just express a non-linear coupling of 
the N.E. (2) with themselves; we deem it more meritorious, at this stage, 
to resist the temptation of adapting the available quantum-field-theoretical 
knowledge to these problems, than to yield to it. 

Clearly, the A.L.H. imposes the distinction between two types of non- 
inearity: that which is expressed by the N.E. (2) and that which is 
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expressed by the M.E. (5). The first is useful, but only the second is} 
necessary for the type of self-organization we are discussing; this state-} 
ment will appear obvious after the discussion in 2 below, which is dedicated, | 
as was announced, to a simplified version of our problem. 

2. We wish to discuss here a model of our machine (which we may also: 
regard as another, though less sophisticated, model of the brain; as such 1 
was briefly discussed at an early stage of our work) which consists in a 
system of N linearly coupled harmonic oscillators, the constants of whic 
obey M.E. of type (5) (r = 0), under the A.L.H. This was mentioned in: 
Section 4, B, 3, and is a model of our machine in the sense that, as was: | 
said there, ‘‘reverberations” correspond in it to normal modes, “sub 
harmonics” to ordinary harmonics, etc. The wealth of solutions of N.E. off) 
type (2) is now lost, because of the linearity of the N.E. of this model; 
for it the central problem reduces to the determination of the solutions o 
a secular equation of degree N: 


Ks A) = 0 (6) 
This digression is useful both because it readily provides a qualitative 
insight into the behavior of a learning machine with respect to self 
organization, and because it shows that many different mechanisms ma 
be built, with various degrees of convenience, to produce “thought” 
the only essential thing is that they obey some N.E., some M.E. (suitableq 
but not necessarily of type (2) and (5) ), and closely enough the A.L.H 
We wish to compute the average change < 5A > of a solution X of (6). 
when the coefficients a,, of the equations of the system undergo infinite4ff 
simal random variations (from a “‘macroscopic” point of view, sal 
variations due to learning will appear as ‘‘random”’): 


Any > Any(l + pax) = Ay + Sanz G 
suchithat?<<'p,.) > =="p. 
From: 
Giz + 84x; A+ oy) 
of of . | 
oe QnkPhk Oayp + ih =| dA =O (8 i 


we find, on taking averages (since (6) is homogeneous of degree zero in} 
the term AY and of degree one in all other terms, with respect to thef 
variables a,,): | 
(X)¥ 
which tells us several interesting things. It requires, first of all, th 


f(A) #09, or else X would be a degenerate eigenvalue, and even wi 


<A>= 


oF 
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p = 0 the degeneracy might be removed just the same by (7). We suppose 
therefore f’(A) 0; to gain some insight into the behavior of < 5\ > we 


just suppose here that the roots of (6) are all simple and equally spaced, 
so that 


NS RAG (R= ti 3h. ON) (10) 
then (9) gives: 
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For our qualitative purposes, (12) and (13) suffice amply to show that 


random variations, due to learning as expressed by M.E. of type (5), may 


alter by a vanishing amount the smaller, by increasingly relevant amounts 


the larger eigenfrequencies of the normal modes of this model. 


If a frequency which is kept long enough unchanged becomes perma- 
nent, this model will evolve therefore with learning so as to preserve the 
modes with small frequency; the modes with higher frequency, as well as 
those that correspond to degenerate solutions, will change at random 


without staying long enough at any given value to become permanent. 


t 


After a “long” time, the system will have shifted spontaneously, because 
of its learning ability, toward a “ground state’’, in which there is no 


| degeneracy, but the allowed frequencies stay as close to one another as the 


maximum and minimum values conceded by the M.E. to the constants 
will permit; this is the ‘“‘senile age” of the system, in which no learning is 
possible because all available memories are already engrammed perma- 
nently (cf. (7): all 5a, = 0). The “infancy” of the system is characterized 


instead by p > 0, because for t > 0 (when all the a,,(0) have minimum 


absolute values) they can only increase monotonically: learning is some- 


what slower, degeneracies are removed faster. The “adult” age corre- 
sponds to the period in which it is mostly 6a,, #0, p = 0. 
This cursory and incomplete glance at the properties of this model is 


intended only to show in which sense we should expect Wiener’s principle 


p 


| 


t 
j 


to be verified by it: the ‘“‘reliable’”’ information is that carried by the small 
eigenfrequencies; when the erratic behavior of the higher frequencies 


pushes one of them down enough, it may be permanently “engrammed”’, 
increasing thus the reliability of the system. 
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3. The main differences between the machine we study and the model J 
of it discussed in 2 above lie, as regards the validity of Wiener’s principle, 
in the non-linearity of the N.E. (2) and in the 2nd criterion of stability | 
(Section 4, c, 4). The situation is made worse to some extent than in (2) | 
by this non-linearity; in any case, though, by assuming the number of — 
elements to be large enough, one ought to obtain a preferential decrease | 
of the effect of random learning variations on the faster reverberations (to | 
higher frequencies of the model described in 2 slower reverberations of 
the machine now correspond), so as to reproduce, although perhaps less 
dramatically, a situation like that represented in 2 by (12) and (13). | 

The existence of a thalamus and of a reticular system (2nd criterion of | 
stability) makes instead a tremendous improvement upon the model dis- | 
cussed in 2, which could still satisfy Wiener’s principle even though it is not 
endowed with these homeostatic devices. Conditioning of the first kind 
and arousal of attention by the thalamus, prevention of too diffuse (and 
therefore not very meaningful) reverberations by the reticular system, are 
only examples, clearly, of what controlling devices of such effectiveness 
may do in the way of forcing the machine into learning important informa- 
tion in a reliable manner. These devices are especially important if the 
memory is prevalently of negative, rather than of positive type as is 
assumed here (Section 3, C, 1), because then the machine would have, in 
its infancy, a tendency towards epilepsy. 

Of importance in a discussion of this principle are also, clearly, all the 
additional special-purpose devices that Nature uses. The machine also 
is better with these devices, as intermediate links between external inputs 
and outputs, and inputs and outputs to the machine proper or cortex; 
their consideration, however, belongs to engineering more than to physics, | 
and would not be relevant at this place. In conclusion, we should like to | 
stress once more our firm conviction that the speediest way to progress in | 
all the problems connected with the actual construction of machines of this |} 
sort is humble resort to Nature’s own doings, through neuroanatomy and | 


physiology. 


C. FURTHER OUTLOOKS 


As a final comment, we think it appropriate to remark that the general } 
formalism of N.E., M.E. and A.L.H. which is expounded here seems to us J 
to admit of a far wider range of applicability than that to which it has been J} 


restricted in this work. The N.E. in fact, for instance, serve only to express, |} 


’ 


in a more or less schematic manner, the fact that a decision is taken, after 
a weighted evaluation of the information which lasts a finite time, by a 
member of a set; and that such a decision is bound to affect other decisions, |] 
etc. We may change their name into that of decision equations, call the |} 
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M.E. evolution equations, and take all our considerations over to the study 
of social or economical or other collective phenomena. 
We have pursued this line of thought in several directions for personal 
amusement, and have soon found, to our surprise, that the qualitative 
analysis given here for thought-processes applies as well, mutatis mutandis 
(that is, names), to a great many other instances. We believe that, as soon 
as it becomes possible to agree on a concrete choice of schemes and 
numbers, quite reasonable predictions may be made in this way about, say, 
the operation of a stock-exchange, the variation in time of a parameter in 

feminine fashion, the type of national government that would best obey 
-Wiener’s principle, and so on. This we say with at least the same degree of 
assurance that we have found in the economists who apply the Schrédinger 
equation to the study of their problems. 

Although we have refrained here from a quantitative analysis of the 

several mathematical problems formulated in the course of this work, 

the results we have already obtained in this direction seem to justify some 
optimism; if these expectations are not illusory, then the present formalism 
might help us to gain a finer knowledge of some physical phenomena 
that can now be treated only with statistical methods. 

This research would not have been possible without the generous and enthu- 

-siastic collaboration of Dr. V. Braitenberg, neuroanatomist, among whose merits 
were certainly not least patience toward this writer’s initial ignorance and pre- 
sumption, and success in eliminating the latter; of Dr. F. Lauria, a young mathe- 
matician who dared to place mathematics not too high above common brains; 
and of many others, to all of whom it is our duty and pleasure to extend our 
sincerest thanks. 
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Aldosterone is the most important of known sodium-regulating hormones, } 
although it is responsible for only a few per cent of total renal sodium re- J 
absorption. In addition, aldosterone can increase renal excretion of 
hydrogen and potassium ions. That its renal effects are direct was first |} 
demonstrated by Barger, Berlin & Tulenko (1958) by unilateral renal artery | 
injection of the hormone into normal and adrenalectomized dogs. More- 
over, these investigators were able to demonstrate a complete dissociation | 
of aldosterone’s kaluretic and anti-natriuretic effects in the normal dog. | 
Ganong & Mulrow (1958), using a similar technic, have also found a sig- J 


nificant separation of these effects in the adrenalectomized dog. Thus the 
possibility that aldosterone accelerates transport systems exchanging 
sodium for hydrogen and potassium remains an unresolved question. 

Recently (Vander et al., 1958; 1960), with the use of stop flow analysis 
technic (Malvin, Wilde & Sullivan, 1958), a distal site of action of aldo- 
sterone has been demonstrated. It is the purpose of this paper to describe 
a model for sodium transport and aldosterone activity based upon these 
findings. 


Site of Action of Aldosterone 


By administration of an osmotic diuretic (mannitol) to the dog, very |] 
high rates of urine flow can be established. If during this period the ureter | 
is clamped, intratubular pressure rises to equal net filtration pressure, at | ; 
which point glomerular filtration ceases. During this period of stopped. 
ureteral flow, the concentration of any substance in the intratubular fluid |] 
column will be changed along the nephron, depending upon how the | 
individual segments handle this substance. The mannitol retards water | 
reabsorption and provides a menstruum against which electrolyte con- 
centrations may be changed. Upon release of occlusion, these concentra- 


+ Supported by the American Heart Association, 57-G-157; the Life Insurance |i} 
Medical Research Fund, G-59-44; the National Institute of Arthritis and Metabolic 
Diseases, U.S. Public Health Service, A-1740-C3S1 Physiology. 
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tion patterns are obtained in approximately 0-5 ml urine samples which 
segment this pattern into an ordered array, best pictured on a graph if 
concentration of each sample is plotted against accumulative urine volume. 
Thus the dashed curve in Fig. 1 is a typical normal sodium concentration 
pattern, demonstrating a fall in sodium concentration as fluid trapped in the 
distal tubules during occlusion enters the urine collector after release of 
occlusion. The line designating PAH maximum indicates the collection of 
fluid trapped primarily in the proximal tubules during occlusion. 

It is evident from Fig. 1 that, after adrenalectomy, the distal tubule was 
not able to reduce sodium concentration to the minimum value achieved 
during ureteral occlusion in normal dogs. The final concentration achieved 
was not altered by prolonging the length of occlusion. 
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Fic. 1. Comparison of distal tubular sodium reabsorption during ureteral occlusion in 
a normal and in an adrenalectomized dog (Vander et al., 1958: by courtesy of the 
Society for Experimental Biology and Medicine). 


Also of significance was the finding (Fig. 2) that the minimum distal 
stop flow sodium concentration was a function of the plasma sodium con- 
centration in the adrenalectomized but not the normal dogs. In the former 
group, as plasma sodium rose (this was effected by injection of hypertonic 
NaCl between occlusions), the minimal sodium concentration attained 
during ureteral occlusion also rose. This indicates that adrenalectomy has 
reduced the maximal sodium concentration gradient which could be main- 
tained across distal tubular cells. Identical results were obtained by 
administration of the steroidal antagonist SC-8109 to normal dogs. Finally, 
all the above abnormalities could be corrected by the administration of 


aldosterone. 
16—2 
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Fic. 2. Relationship between plasma sodium concentration and the minimal distal 
urinary sodium concentration developed during ureteral occlusion in normal and 
adrenalectomized dogs. Lines connect points derived from a single dog as plasma sodium 


was elevated (Vander et al., 1960: by courtesy of the Society for Experimental Biology | 


and Medicine). 


Mode of Action of Aldosterone 


Conventional clearance methods indicate that aldosterone increases 
tubular reabsorption of sodium. This action is manifested in stop flow 
analysis by the ability of aldosterone to increase the maximal sodium 
concentration gradient which can be developed between plasma and distal 
tubular urine. Since the minimal distal sodium concentration attained 


during ureteral occlusion is independent of the duration of occlusion, this | 


concentration must be an equilibrium value, i.e. the concentration at 
which sodium movement out of the lumen is equal to sodium movement 


inward. Aldosterone could act, therefore, in one of two different ways: | 


it could activate carrier systems responsible for sodium transport outward 


or it might decrease passive back-diffusion of sodium from interstitial | 


fluid into distal tubular lumen. 


A rough estimate of such back-diffusion of Na is being made in our | 


laboratory, using the isotope *4Na. The ureteral occlusion stops filtration 


so that glomerular substances such as inulin will not enter the concentra- | 
tion pattern except as new filtrate. *4Na injected intravenously after the. 
occlusion, by crossing the tubular epithelium transmurally, enters the 
urinary pattern ahead of inulin. Precise delineation of rates of movement | 


into the stop flow pattern is complicated by the continuing decline of the 
precursor *4Na in the blood plasma during the short two-minute period 


allowed for the *4Na to enter the tubule lumen. The shape of the stop flow. 


i 
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curves for *4Na seems unchanged after adrenalectomy or SC- giog with 
no suggestion of any increased rate of back-diffusion. 


Effect on Kinetics of Active Transport 


As mentioned above, the length of time of occlusion beyond 3 to 4 
minutes does not alter the final sodium concentration achieved by the 
distal tubule during occlusion, indicating that this concentration repre- 
sents an equilibrium value at which movement of sodium outward is 
equal to its movement inward. This fact allows us to develop the following 
equations, which can be used to characterize, subject to the limitations 
described below, not only the carrier systems responsible for active sodium 
transport but also the influence aldosterone exerts upon them. 

Dp» = K Na, (1) 

Digg = Nay (2) 
where D = rate of passive diffusion of sodium inward or outward; 
K = diffusion constant for Na through distal tubular cells in either 
direction; Na, = plasma sodium concentration; Na; = minimum distal 
tubular sodium concentration attained during ureteral occlusion. 

It must be pointed out, however, that the rate of these passive fluxes 
will be determined not only by the diffusion constant but also by the trans- 
tubular electrical potential gradient. Since this electrical gradient will 
oppose sodium flux in one direction and favor it in the opposite, depending 
upon the orientation of the charge, it cannot be incorporated into the para- 
meter (K). Giebisch (1958) using the technique of micropuncture, has made 
four determinations (27, 37, 39, 40 mV, inside negative to outside) of 
distal transtubular potential in Necturus. Interpolation of these data to the 
dog under conditions of stop flow would be unwarranted. Furthermore, 
since the origin of the potential is unknown, it is not possible to predict 
the effects of increasing plasma sodium concentration upon it. Because 
of the impossibility, at present, of incorporating the electrical potential 
into the equations below, the authors believe that it would still be of value 
to attempt to set up a working hypothesis, ignoring this factor, and sub- 
ject always to question and modification. 

_ If ry, = rate of active reabsorption of sodium, then at equilibrium, 


K Na, = K Na, + rya 

| K(Na, — Nay) = Tye (3) 

The rate, ry,, may be described by an equation analogous to the Michaelis- 
Menten equation of enzyme kinetics: 

Rya Na, 


tna = Na fe Ky; (4) 
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Where Ry, = maximal or saturation rate of distal tubular sodium re- 
absorption; Na; is as above; Ky = that sodium concentration in the 
distal lumen at which the velocity of active outward sodium transport, 
in analogy to the Michaelis-Menten constant of enzyme kinetics, is equal 
to 4 the maximal attainable velocity. Combining equations (3) and (4): 

Ry, Na, (5) 
Na, + Ky : 

In this equation, K, Ry,, and Ky are all unknown. However, Na; and 
Na, can be determined for two different steady state concentration 
gradients, in two separate occlusions on the same dog, in one of which 
Na, and Na, are altered by elevating Na,, resulting in an elevation of Na,. 
These values can then be substituted separately into equation (5) to yield | 
two equations. Divide one equation by the other: 


K(Naj — Naf) — Ry, Naf / Rw, Naj (6) 
K(Na? — Na?) Naf + Ky/ Na?+ Ky 
in which the superscripts a and b designate sodium concentrations at two | 
different equilibria in the same animal. Since it is necessary that Na, vary 


K(Na, — Na) = 


with Na,, such an equation can be obtained only in experiments on 
adrenalectomized or SC-810g treated dogs. In the normal animal, this 
cannot be done, distal sodium concentrations being essentially unchanged 


TABLE 1 


Values of Ky tn adrenalectomized and SC-8109 treated dogs 


Ky, mmoles/r 
SC-8109 Adrenalectomized 


wWhun 
WwW nw 
eH N DN Pp 
ArIMnasUMH 


See text for definition of Ky. 


at all plasma levels of sodium (very small changes, which very likely did 
occur, cannot be detected by stop flow analysis). The constants K and | 
Ry, will cancel out and the equation may then be solved for Ky: 
Nai — Naf _ Naf(Na? + Ky) | 
Na? — Na? Na®(Na? + Ky) 7 
Table 1 lists values of Ky, calculated for adrenalectomized and SC-8109 | 
treated dogs. Evidence to be presented below indicates that Ky, is identical | 
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in intact animals. It can be seen that Ky, is relatively constant in different 
animals and is quite low. This indicates that distal sodium reabsorptive 
systems operate at near maximal velocity over a wide range of intratubular 
sodium concentrations. 

The determination of Ky, as described above, depends upon the basic 
assumption that the Michaelis-Menten equations developed do actually 
apply under the existing experimental conditions. However, it is not at first 
clear how these might be tested. Absolute velocities cannot be determined, 
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Fic. 3. “‘Velocity-substrate”’ plot of data obtained in an SC-8109 treated dog by ele- 
vating plasma Na. Na, = plasma Na; Na; = distal urinary Na concentration developed 
during ureteral occlusion. (Na, — Naj) is the proportional velocity of Na reabsorption. 


See text for discussion of Ky. 


nor can they be calculated since a value for the diffusion constant, K, in 
equation (3) is unknown. Although it would be interesting to evaluate K, 
the validity of fitting the data to these equations can be determined by 
another device. This fact can be appreciated by referring back to equation 
(3). It is important to note again that this equation applies only at equi- 
librium. It is evident that for any given sodium concentrations in plasma 
and distal tubular fluid, velocity of sodium reabsorption will be propor- 
tional to the difference between the concentrations (Na, — Naj). If the 
difference were to be increased by a factor of two, then the relative equi- 
librium velocity would also be doubled. It becomes evident, therefore, 
that these concentration differences can be used to represent transport 
velocities at each of the distal tubular sodium concentrations obtained 
within the same animal. Different dogs can be compared in this manner 
only if it is assumed that the diffusion constant, K, is identical in all 
animals. 

Data obtained for an SC-8109 treated animal in which distal tubular 
sodium concentration achieved during stop flow was changed by increasing 
plasma sodium concentration are shown as a reciprocal plot in Fig. 3. 
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These “‘velocity-substrate”’ plots indicate that the data really fit a Michaelis- 


Menten type analysis. The intercept on the vertical axis in Fig. 3 is equal 


to the reciprocal of the maximal attainable transport velocity, 1/Ry,. If the 


line is extended to the left, the intercept on the horizontal axis is equal to _ 
— 1/Ky, thus providing a graphical method by which Ky can be deter- | 


mined. It is evident that this plot of reciprocals is merely a repetition of | 


the use of equation (7) and will yield the same values for Ky, but only if | 


the data really are linear, i.e. fit the proposed analysis. All values for Ky, 


(see Table 1) derived by equation (7) are identical to those determined — 


independently by this graphical means. The fact that the data are linear 
may further indicate that the possible error introduced by omitting 
electrical potentials may not significantly alter results. 
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Fic. 4. Effect of aldosterone on carrier systems responsible for distal sodium reabsorp- 
tion in an adrenalectomized dog. Nay = plasma Na; Nai = distal urinary Na concen- 


tration developed during ureteral occlusion. (Nay, — Naj) is the proportional velocity of | 


sodium reabsorption. See text for discussion of Ky. 


Figure 4 demonstrates the action of aldosterone upon distal sodium 
reabsorptive systems, as revealed by these graphical methods. Three 
occlusions were performed in an adrenalectomized dog, increasing plasma 
sodium after each occlusion. After completion of the third occlusion, 6 pg 


aldosterone were administered intravenously over 5 minutes and 1 yg/hr | 
was incorporated into the normal infusion. Three occlusions were then | 


performed at 15 minute intervals after first waiting 2 hr 10 min. Plasma 
sodium was again elevated by salt injection between occlusions. Aldo- 
sterone dosage was kept at this level so that it permitted a degree of sodium 
elevation in the distal tubule as plasma sodium was elevated. As already 
described, this is extremely difficult to do in a normal animal or in an 
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adrenalectomized animal given large amounts of aldosterone. It should 
be pointed out that comparison depends upon the assumption that the 
diffusion constant, K, is not changed by aldosterone, an assumption be- 
lieved to be valid on the basis of the isotope studies described earlier. 

From the data shown in Fig. 4, values for Ky, before and after 
aldosterone, can either be calculated using equations described above, or 
graphically determined by extending the lines to the left and measuring 
the intercepts on the horizontal axis —1/Ky. It is this constant, a measure 
of the degree of association between carrier and sodium (in analogy to the 
affinity of an enzyme for its substrate) that would be altered by a competi- 
tive inhibitor or a coupling activator, but it can be seen that K,, is essen- 
tially unchanged by aldosterone. 

Without changing this intercept on the horizontal axis, aldosterone has 
decreased the intercept on the vertical axis 1/Ry,, so that the entire 
“velocity-substrate” line lies under that determined for the untreated 
adrenalectomized dog. Thus, aldosterone, without altering K,,, has in- 
creased transport velocity at each substrate concentration as well as having 
increased the maximal transport velocity attainable at infinite substrate 
concentration. This is the pattern proposed for a “‘non-coupling”’ activator 
(Friedenwald & Maengwyn-Davies, 1954). Stated more simply, aldo- 
sterone does not change the characteristics of the carrier systems, but in- 
creases the effective concentration of these carriers available for sodium 
reabsorption. 

Although the data fit the Michaelis-Menten equation, this does not 
exclude the possibility that some other type of transport system geared to 
the saturation principle is involved. However, since there is no direct 
evidence bearing on the exact nature of the Na transport system, one can 
only speculate as to possible reactions involved. The above analysis merely 
presents a model for Na transport and describes the influence of 
aldosterone upon that system. 
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Some consequences of the hypothesis that ribonucleic acid codes amino 
acid sequences with a coding ratio of one are discussed. The hypothesis 
predicts certain limitations on amino acid replacements in proteins as a 
result of mutation. Available evidence is reviewed and found to conform 
with prediction. 


Recently I reviewed evidence indicating that information determining the: 
sequence of amino acid residues in viral protein is coded in viral ribo-| 
nucleic acid with a coding ratio of one, so that one nucleotide corresponds | 
to and determines one amino acid residue. Part of the evidence for this: 
hypothesis is that amino acids can be assigned to nucleotides in such a/ 
manner that the mole fraction of each group of amino acids in viral protein | 
is equal to the mole fraction of the corresponding nucleotide in the viral 
ribonucleic acid. It is thus possible to predict the composition of viral 
ribonucleic acid from knowledge of the composition of the corresponding 
viral protein. Assignments producing this result are as follows: Adenylic | 
acid: glutamic acid, glutamine, glycine, leucine, phenylalanine and 
tryptophan. Uridylic acid: aspartic acid, asparagine, isoleucine, histidine } 
and serine. Guanylic acid: alanine, arginine, tyrosine and valine. Cytidylic 
acid: cysteine, lysine, methionine, proline, and threonine (Yéas, 1960). 

It is the purpose of this paper to develop certain deductions of the 
hypothesis as they bear on mutations affecting protein structure, and to 
show that these deductions are supported by empirical evidence. 

A coding ratio of one evidently implies that viral ribonucleic acid 
possesses only part of the information required to specify a protein, since) 
each nucleotide limits, but does not finally determine, the choice of a 
residue at any given position. Supplementary information must be ; 
contributed by some structure in the host. As illustrated in Fig. 1, the | 
complete information bearing structure is assumed to be composed of two. 


t For evidence against the supposition that neighboring nucleotides provide additional 
information see Yéas, 1958. 
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components: a, here viral ribonucleic acid, and 8. The 8 component is as 
yet chemically unspecified and may itself be’ a compound structure. On 
general biological grounds it seems reasonable to postulate that, if this is 
correct, such a scheme may apply generally: a 8 and a non-viral a com- 
ponent being always present in the cell for purposes of normal protein 
synthesis. 


> en a} ae: 
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_ Fic. 1. Illustration of the concept that the material specifying the amino acid sequence 
in proteins is a double structure. An amino acid at any position is completely specified by 
a nucleotide in RNA and an element X in the 8 component. 


Consider now the result of a mutation in the a or 8 component. (The 
term “mutation” is here used in the sense of any inherited change, what- 
ever its nature.) To change the a component, at least one nucleotide must 
be replaced by another of a different kind, and the amino acid previously 
specified at that position must therefore be replaced by one belonging to 
a group specified by a different nucleotide. For example, if the original 
nucleotide was adenylic acid, the new amino acid will be one specified by 
-cytidylic, guanylic or uridylic acid, but not by adenylic acid. 
If, on the other hand, a change is produced in the 8 component, the 
new amino acid will belong to the group determined by the same nucleotide 
as the one previously occupying that position, since the nucleotide in the 
a component remains the same. Thus replacements prohibited as a result 
of ‘‘a mutation”’ are the only ones permitted as a result of “8 mutation’, 
and vice versa. Since the assignments of amino acids to nucleotides are, 
by hypothesis, known, these predictions are quite definite and can, in 
principle, be checked by observation. 

In the case of viruses, it is possible to be certain that all mutations are 
a mutations, since different mutants, i.e. a components, can be grown in 
the same host and thus produce proteins with information contributed by 
the same 8 component. All amino acid replacements in viral protein 
resulting from mutation must therefore involve amino acids assigned to 
different nucleotides. The evidence available does not contradict this 
requirement (Table 1), but unfortunately it is as yet too meager to be 
conclusive. 

More evidence is available on replacements involving non-viral proteins. 
As in these cases it is not possible to separate experimentally the a and 
8 components, it might seem hopeless, at first sight, to observe separately 
the processes of a and f mutations. However, there remains the possi- 
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bility that the a and 8 mutation rates are significantly different, and thus | 


could be distinguished on this basis. 
Suppose, as an example, that a mutation is the more frequent. Then 
replacements arising during some short period of time and differentiating 


originally identical proteins would be mainly of a type prohibited to B | 


mutation. However, if the period of observation were lengthened, f 
mutations would begin to accumulate, so that eventually observed differ- 
ences between two originally identical proteins would involve replace- 


ments which were randomly distributed. Providing only that the rate of | 


8 mutation were not zero, randomization of differences would always 
result. The same result would be eventually attained if 8 mutation were 


more frequent, with, of course, an initial preponderance of differences | 


prohibited by a mutation. 

Thus, observation of proteins originally identical and differentiating for 
varying lengths of time should make it possible to determine if the dis- 
tinction between a and 8 mutations is real, providing that the rates of these 
two types of mutations differ. 

Because of technical difficulties, replacements arising as a result of 
observed mutations are not known. However, a considerable body of 
evidence on sequences of amino acids in homologous proteins from 
different species exists (Table 1). Assuming the truth of the theory of 
evolution, hereditary differences must be assumed to arise as a result of 
mutations. The taxonomic distance between species can be used to measure, 
roughly, the length of time the proteins of two species have been diverging 
from a common ancestor, so that it is also possible to classify homologous 
proteins as “recently” and “‘anciently” divergent. As the rate of change of 
proteins appears to be very slow “‘recent”’ and “‘ancient’’ is to be measured 
in units of geological, rather than ordinary time. 

Before comparing protein sequences, it is necessary to clarify the 


meaning of the term “‘homologous’’. In the usual sense it refers to proteins | 
of the same general structure and function in different species that have | 
identical sequences except at a few positions, e.g. insulin from man and | 
pig. Clearcut cases of this type will be referred to as class A homology. | 


However, some proteins, while obviously homologous in the sense of 
having similar sequences, do not exactly fit this definition. For example, 
oxytocin and vasopressin have seven of nine amino acids identical, but 


occur in one organism and have different functions. The same is true of the | 
corticotropins, two forms of which are found in the pig, both similar and | 


having an amino acid sequence closely resembling the melanophore- 
stimulating hormones. These instances will be regarded, rather arbitrarily, 
as class A homologous for present purposes. 


A homology can also exist between different chains of a single protein 
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Fic. 2. Distribution of replacements differentiating “recently”? (a) and “‘anciently” (6) divergent proteins, compiled from Table 1. 
Hatched areas are replacements prohibited as a result of a mutation. AD (adenylic acid), UR (uridylic acid), GU (guanylic acid) and CY 


(cytidylic acid) are nucleotides specifying the amino acids shown to the left. Amino acid abbreviations as in my 1958 paper except 


Gln—Glutamine, Asn—Asparagine, Ilu—Isoleucine. 
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(class B). In a previous publication in collaboration with Gamow and 
Rich (Gamow, Rich & Yéas, 1956) I have drawn attention to the similarity 
between the two chains of insulin, as well as to the tendency for the end 
groups of different chains of single proteins to be identical (Y¢as, 1958). 
As an explanation it was suggested that the several chains were originally | 
one protein specified by a single genetic locus. By reduplication of genetic 
material the chains began to be controlled by different loci, which gradu- 
ally diverged by genetic drift. At that time the evidence for this suggestion 
was slight, but the recent work of Braunitzer, Liebold, Miller & Rudloff 
(1960b) has, at least in part, vindicated this position. They have shown 
that the a and £ chains of human hemoglobin are so similar that a common 
origin is clearly indicated. As the hemoglobin of the lamprey, presumably 
representative of the cyclostome ancestor of higher vertebrates, is single 
chained (Allison et al., 1960), the two chains of hemoglobin must have 
begun to diverge since about the Devonian or Silurian period, and are an | 
example of homologous proteins long divergent. Since fish insulins are also | 
two-chain proteins (Yamamoto, Kotaki, Okuyama & Satake, 1960), the 
chains must have diverged at a comparable, probably even earlier period, | 
as might indeed be concluded from the relatively slight similarity remain- | 
ing between them. Of course the mere presence of two chains must not, by | 
itself, be taken as evidence that the divergence of the chains is ancient. 
Thus the growth hormone of man is single-chained, while that of cattle is, 
apparently, double (Table 1). Unless the single chain condition is the 
result of loss, this double-chain structure must be a relatively recent 
development.t 

On the basis of these conclusions I classify known protein sequences 
into two groups: recently divergent, mostly mammalian so the diver- 
gencies are not older, and preponderantly much younger, than the meso- 
zoic; and anciently divergent, which includes the two chains of insulin and 
hemoglobin and homologous sequences of cytochrome C from Rhodo- | 
spirillum rubrum, yeast and cattle. These cannot have diverged later than 
the Devonian, and in the case of the cytochrome certainly vastly earlier.+ 


+ An examination of homologous sequences in Table 1 shows that, besides the well-| 
known phenomenon of replacement of one amino acid by another, a deletion of residues 
can also occur, as first pointed out by Sorm and his collaborators (1957). This is clearly 
demonstrated by the sequences of the two chains of hemoglobin. The known sequences | 
of melanophore-expanding hormones appear to be fragments of an originally longer 
sequence, which has suffered breakage at various distances from both ends. A C-terminal 
deletion also seems to have occurred in fish insulin. 

{ Although in principle data on amino acid sequences are reproducible at will, in 
practice the great labor involved in obtaining them makes an independent verification 
difficult, so that most of such data must be accepted on the basis of the work of a single 
laboratory. Some parts of the sequence of ribonuclease, for example, depend indeed on a 
single experimental determination (Hirs, Moore & Stein, 1960). It is therefore legitimate 
to ask to what extent such data may be contaminated by experimental error. No general 
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It is now possible to consider whether there is, in fact, any difference 
between the distribution of replacements in recently and anciently 
divergent proteins. Replacements are considered to be all permutations of 
different amino acids which occupy one position in homologous proteins, 
listed in Table 1. Thus position g in the A chain of insulin is evidence for 
one replacement, serine-glycine; position 30 in the B chain for three, 
serine-threonine, serine-alanine and threonine-alanine. If one plots 
replacements in recently and anciently divergent homologous proteins on a 
grid, letting the hatched areas represent replacements prohibited as a 
result of an a mutation, one finds the distributions shown in Fig. 2. If 
replacements were random, they would be distributed between the two 
areas approximately in the ratio of 149 : 41. 

In fact, replacements differentiating the recently divergent proteins are 
distributed quite differently. Of 62 such replacements, only two (glutamic- 
glycine, hemoglobins A and G and alanine-valine, serum albumins) occur 
in the hatched area Fig. 2a). The probability that such a distribution is 
a chance deviation from what is actually a random distribution is very small 
(x? = 12-3), so that Fig. 2a provides strong evidence that avoidance of 
replacements prohibited as a result of a mutation is real. 

On the other hand, the distribution of 44 replacements differentiating 
anciently divergent proteins is in the ratio of 32 : 13 between the two areas 
(Fig. 2), which is close to a random distribution (y? = 1-4). Comparing 
the two distributions, one can draw the following conclusions: 


answer can be given to the question, but some idea of the possible size of error can be 
obtained by considering a few cases where experimental cross-checks have been made. 
Thus the peptide, now known to be N-terminal, of human hemoglobin A was originally 
stated to have the sequence shown in parentheses in Table 1 (Ingram, 1959) and later 
corrected to the sequence shown below (Hunt & Ingram, 1959). The differences are by 
no means small. As another example, the sequence of Tobacco Mosaic Virus protein as 
determined in two different laboratories is also shown in Table 1 (Anderer, Uhlig, 
Weber & Schramm, 1960; Tsugita et al., 1960). In this case it is not clear to what extent 
the differences may be due to the use of slightly different strains of virus, but certainly 
at least some must be the result of errors in one laboratory or another. As these examples 
are taken from the published work of established investigators with a reputation in their 
field, it is clear that a considerable number of errors may exist in the published sequences, 
introducing what one must consider as “‘noise”’ in the data. Such “‘noise” would, of course, 


tend to randomize the apparent distribution of replacements. 


+ If the occurrences of any replacement were equally probable, 0:2158 of all replace- 
ments should be found in the hatched area. However, the actual probability of any given 
replacement should be proportional to the product of the abundances of the two amino 
acids in the protein sample. Taking the average abundance of amino acids to be that 
given in Table V of my 1958 paper, the sum of such products occurring in the hatched 
area is 0°212, virtually identical with the number given above. Replacements forbidden as 
a result of a mutation are, therefore, not a class whose occurrence is a priori less probable 
on statistical grounds. On a random basis the probability that of 62 replacements two or 
less will occur in the hatched area is 0°000035. 
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1. Replacements differentiating the recently divergent proteins strongly 
avoid transitions which the coding hypothesis prohibits as a result of | 
mutation in the a component. 

2. Randomization of replacements differentiating the anciently divergent 
proteins indicates that the 8 component can also mutate, but at a much 
slower rate than the a component. 

3. The random distribution of replacement in Fig. 2) speaks strongly 
against the view that the non-random distribution in Fig. 2a is due to a 
selection process, which makes certain transitions unobservable 
because they result in an inviable organism. This suggestion is in any 
case implausible, since many replacements plotted in Fig. 2a involve 
amino acids of very different types, such as glutamic-lysine, or 
arginine-serine. 


The concrete assignment of replacements into two categories was based 
on considerations that had nothing to do with the replacement process 
(Yéas, 1960), so that the observed avoidance of replacements prohibited 
by a mutation is strong independent evidence in favor of the original 
coding hypothesis. Since the hypothesis was deduced from data on the 
composition of viruses, while the data on replacements is mainly from 
non-viral proteins, it is further evident that a ribonucleic acid analogous 
in function to that of a virus must be presumed to be present in uninfected 
cells as part of the normal protein specifying mechanism. Identification of 
the 8 component remains a problem for future research. 

This work has been supported by Grant G-g9753 from the National 
Science Foundation. 
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A new suggestion is made based on model work associated with similar 
measurements on the biological material itself. The primary quantum 
conversion act is an ionization occurring in a charge transfer complex 
This is what it amounts to in chemical terms. But this process cannot 
occur in isolated charge transfer molecules in solution because the pro- 
ducts cannot escape from each other. The primary quantum conversion 
as it occurs in modern photosynthesis can only take place in a laminated 
structure where the electrons and holes can escape from each other by 
electron migration and not by atomic migrations. This is the essential 
feature introduced here which differs from all the previous notions 
of how quantum conversion occurs in chemistry or biology. 


1. Introduction 


One can hardly begin a discussion of the problem of photosynthesis, or any 
specific aspect of it, without writing a small equation which will define and 
delimit the discussion. The overall reaction of photosynthesis, the reaction | 
by which green plants convert electromagnetic into chemical energy, is 
usually written in this form: 


CO, + H,O 2) (CH,O), + O, | 
The substances on the left-hand side of the equation (CO, and H,O) are | 


the elements of carbon, oxygen and hydrogen in their lowest energy form, | | 
and the substances on the right-hand side of the equation (carbohydrate 


and oxygen) represent these same elements at a higher chemical potential. | 


The carbohydrate and the oxygen normally, in the animal body and in the | 


plant too, for that matter, can back-react, producing carbon dioxide and |] 


water and, at the same time, liberate energy in one form or another— |] 
energy for growth, energy for heat, energy for whatever purpose the | 
organism might want it. | 

Certain aspects of this problem of energy conversion will not be the |} 


| The preparation of this paper was sponsored by the U.S. Atomic Energy Commission. | 
} Research Professor of Chemistry in the Miller Institute for Basic Research in Science, 
University of California, Berkeley, 1960-61. 


258 


QUANTUM CONVERSION IN PHOTOSYNTHESIS 259 


subject of this discussion, either because they have been resolved or because 
we know little about them. In the first category lies that part of energy 
conversion in which carbon passes from carbon dioxide into carbohydrates. 
With the use of tracer carbon, we have been able in the past fourteen 
years to draw an almost complete road map from carbon dioxide to the 
various chemical compounds that go to make up the plant (Bassham & 
Calvin, 1957, 1960, in press; Bassham, 1959), principally carbohydrates. 
In the second category is the conversion of the oxygen from water to 
molecular oxygen, at the opposite end of our knowledge level; we know 
nothing, really, about how the single oxygen atom in the water molecule 
finds another one and becomes an oxygen molecule—in other words, how 
the oxygen-oxygen bond is created. We have some ideas about it, but very 
few in contrast to what we know about the construction (the actual 
building) of carbon compounds. We know very little about how we put 
together an oxygen molecule (Dorough & Calvin, 1951; Anderson, Blass 
& Calvin, 1959; Sapoznikov, Eidelman, Bazhanova & Popova, 1959; 
Mason, 1957). 

In between these two phases of our knowledge of the process of photo- 
synthesis and energy conversion lies the area of the present discussion. It 
is the aspect in which the electromagnetic quantum (the light quantum) is 
absorbed by the chlorophyll to give an excited electronic state of chloro- 
phyll, and then something happens to this excited electronic state, during 
which time it is converted into chemical potential, definite molecular 
species which, upon back-reaction, could liberate energy. That particular 
step is the primary concern of this paper. 

To isolate for consideration that step from the equation as it is written, 
we may describe the events as follows: 


Chen Se ah 


Giienie = sero seth) 


The quantum is first absorbed by the chlorophyll molecule; then some- 
thing happens (p for primary) to the excited chlorophyll to produce two 
chemical species ([O] and [R,] for example) which later can go on, one of 
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them [O] to become molecular oxygen in some way (1), and the other one 
[R] leading to the reduction of carbon dioxide to carbohydrate (2). Along | 
these two routes various other energy-containing species may be created, 
such as phosphoric anhydride (ATP or ~ P). A phosphoric anhydride — 
species, represented by ATP, would, of course, be an energy storage | 
product. These may be created on either, or both, sides. Further than ] 
that there may even be back-reaction (3) between these intermediates 

(oxidants and reductants) which also could create various products of J 
higher energy. The obvious one to use here is, of course, the pyrophosphate _ | 
linkage. The creation of a pyrophosphate linkage of this sort in a water | 
milieu is storing energy. 


2. Photochemistry of Chlorophyll 


No attempt will be made to describe the biochemical detail of any of the — 
steps beyond (p). This paper will be limited to the very first thing that 
happens to the quantum after it has been absorbed by the chlorophyll 
molecule to produce an excited state of the chlorophyll. What are the very 
first forms in which stable (definable) chemical species different from 
electronically-excited molecules (such as excited chlorophyll) appear? 
I shall not be concerned with how the intermediate oxidant [O] becomes 
oxygen (Z) or what other intermediate oxidants might be, nor shall I 
consider what the hydrogen carriers might be which eventually reduce 
carbon dioxide to carbohydrate (2) or how, along the line (2) as they drop 
in potential, they might produce other high energy containing materials 
such as ATP. The recombination (3) oxidant and reductant which might 
also occur as succeeding chemical steps, will also lie outside our present 
concern. My concern is the immediate fate of the excited chlorophyll and 
what could possibly be the very first of these species here called oxidants 
and reductants. 

In order to try and get some idea of what could happen to the excited | 
chlorophyll, two additional ideas will be introduced. First of all, I shall | 
examine the biological apparatus that performs this operation (insofar as | 
we know what molecules that biological apparatus is made of and how it | 
is constructed), and, secondly, I shall explore some model experiments 
which are based upon what we believe is the construction of this biological 
apparatus. (This latter is almost exclusively physical chemistry or physical- | 
organic chemistry.) Then I shall go back and apply the concepts that are 
devised from the combination of the structural information and our model | 
researches, to the biological material itself, experimental observations on | 


the biological material designed to simulate or reproduce the observations 
that were made on the model systems. 


" Sn nn nena 
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PHOTOCHEMISTRY OF CHLOROPHYLL IN SOLUTION 


Before going into the details of this, it seems worthwhile to introduce the 
point of view that dominates these discussions. From the very beginning 
of our knowledge of the structure of chlorophyll (in 1911 Willstatter & 
Stoll (1939) first had a good idea of what the structure was), chemists, 
biologists and biochemists tried to understand the photochemistry of 
chlorophyll itself. As they extracted chlorophyll from leaves of green 
plants and worked on the structure of it, they studied its photochemical 
behavior as well. The Fischer formula has since been confirmed com- 
pletely (Woodward et al., 1960), and we can now proceed with complete 
confidence in it. 

Over a period of some forty years photochemists made a wide variety of 
experiments in an attempt to see how the energy of a 40 kcal quantum 
(which is what is involved here) could be converted in a single act into 
chemical potential. An enormous literature (Gaffron, 1933; Schenck, 
1957; Krasnovsku, 1960; Livingston, 1960) exists on the photochemistry 
of chlorophyll and models of it. A great many attempts have been made 
to find ways in which the energy of 40 kcal in an excited electronic state 
might be used in a single act to create two chemical species that potentially 
could back-react with about 40 kcal—in other words, to store almost all of 
that 40 kcal. Even if only 35 kcal were stored, that would be a lot to store in 
particles created at the same point. This search has not been successful, 
in spite of forty years work, and the many men’s lives involved in it. The 
attempt to find a chemical reaction, either sensitized by chlorophyll or by 
any of its analogs or by model substances representing it, in which the 
energy of 40 kcal could be converted into a pair of chemical species storing 
something of the order of 30-35 kcal (the efficiency of this process must be 
very high) has not succeeded. 

In retrospect, it is not very surprising that it should have not yet 
succeeded. If this energy conversion process is to take place in chlorophyll 
molecules which are simply moving about at random in ordinary solution, 
and in contact with a variety of molecules with which they could react and 
to which they could give energy, it is necessary to create, in one operation, 
a pair of energy-rich species A and B.t Then A + B by definition, in their 
back-reaction have 35 kcal of energy to set free, and they have to be created 
in one act right on or near the chlorophyll molecule. Some rather tricky 
kinetics must be involved. Most chemical reactions do not have activation 
energies that high; usually they are only around 20 kcal. To store 35 kcal 
from the starting point (let us define A-B as the starting point: this could 


+ These may be in different parts of the same molecule in which case the photoreaction 
night be called a rearrangement. 
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be a molecule or molecular system) the end product, A + B, has to be | 
35 kcal above it. If this product is not to return immediately, there has to } 
be a barrier between it and the starting point so that the system will not 
fall back immediately in the back-reaction. This cannot be done; if we } 
are to store 35 kcal and we have only 4o kcal in the quantum with which | 
to do it the barrier cannot be more than 5 kcal high and the back-reaction | 
would be too fast. Essentially the problem is: To separate the products 
which are themselves of high potential energy for reaction before back- 
reaction can take place. This is very hard to do in ordinary statistical 
chemical reactions. In fact, it has not yet been done. 

There are a number of cases in which the photochemist has succeeded 
in storing energy in a straightforward photochemical reaction in solution, 
but, in general, those storages are very small, a few kcal at most, and 
40-60 kcal quanta are used to accomplish this. The situation, therefore, is 
just the reverse of the natural reactions of chlorophyll. Instead of the 
product being 35 kcal above the starting point, it is only 5 kcal, with a 50) 
kcal quantum to help, and the barrier can be quite high (45 kcals by these 
numbers). One can succeed in that kind of storage problem. 

The point of view adopted here is that this 35 kcal energy storage is not 
the result of ordinary statistical photochemistry in solution, but rather is | 
the result of a photophysical process in an organized solid or quasi-solid 
matrix. How this is achieved in this case, in contrast to solution chemistry, 
will be the substance of this discussion. We did model work to show that 
this was possible in model systems. We then went on to ask if the phenom- 
ena we see in the model systems could be reproduced in the biological 
material itself. 


3. Photophysical Effects in Model Systems 
(A) ENERGY TRANSFER IN MODEL SYSTEMS 


One of the factors that contributed to the adoption of this viewpoint was | 
the examination of the structure of the biological apparatus that accom- | 
plished the energy conversion (Steinmann & Sjostrand, 1953; Frey-| 
Wyssling, 1957). Plate I shows the chloroplast of a green plant in which this 
energy transfer occurs. The chloroplasts contain the chlorophyll, and it is, 
in these (a few microns in size) that the energy conversion process occurs. | 
Plate IT is an electron micrograph of a single chloroplast, at much higher | 
magnification, which shows the internal structure of one of the chloroplasts | 
shown in Plate I. There is a very high degree of organized structure to be. 
seen inside the chloroplasts. The dark areas are the so-called lamellae 
which are present in all photosynthetic organisms. In this particular one | 
(tobacco) these lamellae are arranged in stacks, and the term ‘“‘granum’’ has 
been applied to a single one of these ellipsoidal packages which can be. 


Pate I. Cells of liverwort showing chloroplasts. 


b 


Piate II. Tobacco Chloroplasts. 24-36 hr. in dark before fixing with permanganate 
(Weier, University of California, Davis). 
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separated from the chloroplasts. There is, then, a high degree of order to 
be found inside the chloroplast. In fact, if one takes a smaller section of this 
granum at still higher magnification, one can see that these are made up of 
what look like little oval sacks pressed together. The darkest areas appear 
to be the contact areas between the two surfaces of completely enclosed 
oval or ellipsoidal sacks. 

Figure 1 shows a diagram of our concept of what the layers of the chloro- 
plast are composed of (Park & Pon, 1961). Each of the dark areas represents 
a contact between the surface of two of the ellipsoidal packages. We 
believe, for the moment at least, that these are the areas in which the 
chlorophyll is located. We do not know as yet what the orientation of the 
various pigments (the chlorophyll and carotenoid pigments which are 


Intergrana area —> | «— Grana area 


Fic. 1. Model for chloroplast lamellar structure (Park & Pon, 1961). 


present in this structure) is with regard to the particular boundaries that 
are visible in the electron microscope. This is one of the major problems 
which the physical chemists working with macromolecular materials must 
begin to solve. 

We believe that, because a high degree of order is visible, insofar as we 
have been able to see it, we shall find (when the research apparatus and 
facilities are able to carry us further) that this high degree of order is 
actually carried right down to the molecular level, right down to the chloro- 
phyll plane itself and the carotenoid as well. While we could draw a 
picture as to what we think the molecular orientations may be, these 
pictures are in a large measure conjecture (Calvin, 1958). There are 
certain bits of evidence such as polarization, absorption and fluorescence 
which lead us to believe that the chlorophyll planes are oriented in these 
particles. The strongest argument, however, is our conviction that there 
must be order down to the molecular level so as to provide order at 
this higher level (Calvin, 1961). 


264 MELVIN CALVIN 


There is another bit of evidence which is seen in Fig. 2. This shows the 
absorption spectra of chlorophyll in various states: in solution; in a 
colloidal state; and in a crystalline layer. The absorption spectrum of 
chlorophyll in the plant itself resembles the latter two more than the first ] 
one. 


——"IN SOLUTION" 
---- COLLOIDAL 
—-— CRYSTALLINE 


OPTICAL DENSITY 
9 fe) 
rs 


a2 


400 500 600 700 800 
WAVELENGTH — my 


Fic. 2. Absorption spectra of chlorophyll in various states (Jacobs, Holt & Rabino- 
witch, 1957). 


Evidently the plant chlorophyll is not chlorophyll in solution: it is lipid, | 
protein and chlorophyll (with other pigments) in a tight package; in a 
semicrystalline form. I am not emphasizing the spectrum itself as the only 
bit of evidence, but simply as one piece indicating the ordered array which 
the chlorophyll in the chloroplast itself is likely to turn out to have when | 
we know it. 


(B) RELATIONS BETWEEN CHLOROPHYLL, PROTOCHLOROPHYLL AND 
BACTERIOCHLOROPHYLL 


What is the molecule we are talking about? Fig. 3 shows three of the | 
chlorophylls with which we are normally concerned. The middle structure | 
shows chlorophylls @ and 5; chlorophyll a has a methyl group in the | 
3-position and chlorophyll b has a formyl group (formaldehyde) in that | 
position. Bacteriochlorophyll is found in all the photosynthetic bacteria | 
which do not make oxygen but which do reduce CO,. The essential | 
difference between plant chlorophyll and bacteriochlorophyll is the fact | 
that the latter has two extra hydrogens on the opposite pyrrole ring (at 
positions 3 and 4) as compared to a double bond for the plant chlorophyll; | 
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the total redox level remains the same, since the 2-vinyl group is now 


oxidized to acetyl. The hydrogen atoms are just at a different place. In 


both the plant chlorophyll and bacteriochlorophyll, the macrocycle re- 


mains conjugated, but it is somewhat more limited in the bacterio- | 


chlorophyll. 
Protochlorophyll belongs to the class of compounds known as porphy- 
rins; it is dehydrogenated at positions 7 and 8 compared with chlorophyll, 


but that is the only difference between them. Protochlorophyll appears in — 


etiolated plants, that is, plants grown in the dark from seed and which 


have never seen the light. Protochlorophyll is converted into chlorophyll | 


immediately upon illumination (Smith & Coomber, 1960). These “extra” 


hydrogens, the 7 and 8 pair and the 3 and 4 pair, are the two points of the 
chlorophyll that workers (including ourselves) have focused their attention 
on for the last twenty years in an attempt to do solution photochemistry 
(Seely & Calvin, 1955). We thought that perhaps one or the other of 
these pairs of hydrogen atoms was being transferred back and forth by the 
photochemical reaction, but now the evidence seems to indicate that this 
is not the case and the chlorophyll does not function in this way. 

The main feature of the chlorophyll structure is the big conjugate 
macrocycle, the so-called dihydroporphyrin ring (chlorin ring) which is 
the light-absorbing entity of the photosynthetic apparatus. This is what 
makes plants green. The phytol side chain seems to be part of the archi- 
tecture that holds the molecule in place. I do not believe the phytol chain 
plays a part in the energy transmission, at least not directly. The 6800 A 
-40 kcal quantum is absorbed by the electronic system of this conjugated 
macrocycle with the magnesium in the center, and from there we do not 
know what happens. This is what we are trying to discover and are 
speculating about. 


Presumably, a very similar process goes on in the bacteria with the | 
bacteriochlorophyll, the difference being that, in the bacteria, oxygen is not | 
liberated. The primary oxidant is instead reduced by some chemical | 


reducing agent other than water. 


So much, then, for what we know about the biological equipment that | 


is to perform the energy conversion that was described earlier. I have not 
mentioned the accessory pigments of which there are several and of which 
at least one will probably turn out to be as important as chlorophyll. 
People generally overlook this, although a little thought reveals that it 


shouldn’t really be overlooked. Wherever there is chlorophyll, wherever | 


there is photosynthesis, there is also carotenoid. In general, people have 
tended to ignore this, or at least have not given enough weight to the fact 
that somehow carotenoid and photosynthesis must be very closely associ- 
ated. The carotenoid is the long conjugated carbon chain (polyisoprene 
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with 10 to 12 double bonds in it and some oxygen at each end), and a 
variety of functions have been proposed for it: oxygen carrier (Dorough & 
Calvin, 1951), electron carrier (Calvin, 1958; Platt, 1959), or hydrogen 
carrier (Calvin, 1959a; Shlyk, Godnev, Rotfarb & Lyakhovich, 1957), and 
probably one of them is right, but the difficulty is to know which one. 

With this structural background on the photobiological apparatus, let 
us turn first to the question of generating an idea as to how it might work 
(other than ordinary solution photochemistry) in the solid state, i.e., the 
organized state which very certainly exists. Then I shall describe some of 
the model experiments that have been done in an attempt to expand, or 
explore, the concepts that were generated by the combination of knowing 
the fact that there is such a fine structure; that the flat chlorophyll molecules 
tend to lie one upon the other; and that there is a difference between the 
way the crystal or pseudo-crystal behaves and the way the molecules in 
solution behave. 


(c) PHTHALOCYANINE AS A MODEL FOR CHLOROPHYLL ENERGY TRANSFER 


About 1950 the developments in solid state physics finally reached the 
chemists. By this I mean the developments in our knowledge of the 
electrical and magnetic properties of atomic and ionic crystals had reached 
a stage, both of technical development and understanding, which allowed 
{ us to apply some of the notions that were common amongst the physicists 
| developing this work to the kinds of molecules and the kinds of systems 
that we had in this biological apparatus, particularly the big, flat aromatic 
systems such as chlorophyll. 

I had for some years been working with porphyrin analogues. The first of 
these, and the one that is still one of the most popular, I encountered in 
1936, the year it was discovered in England, and this is the molecule of 
| phthalocyanine. It is a synthetic compound that resembles, in some respects, 
' the structure of the tetrapyrrole visible in chlorophyll. Phthalocyanine 
differs from chlorophyll in certain rather important aspects, but the most 
| important difference is that it is easily made compared with chlorophyll; 
' it is also easily handled and very stable, and neither of these qualities is 
) true of chlorophyll. This is the reason we selected phthalocyanine as a 
+ model of the porphyrin structure found in the chlorophyll in an attempt 
’ to find out how the solid array of molecules might differ in their physical 
} and chemical properties and reaction to light from molecules in solution. 
| The structure of phthalocyanine was determined in 1935-36 by Lin- 
} stead (Linstead, Eisner, Ficken & Johns, 1955) at Imperial College. It is 
} shown in Fig. 4. It is made from phthalonitrile and metal; the ring closure 
occurs very readily. It has the elements of the tetrapyrrole in it, but it 


i differs from a true tetrapyrrole in that the bridging atom instead of being 
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CH is nitrogen, so it is called a tetrazaporphyrin. It also has benzene rings 
fused on to the pyrrole rings. Phthalocyanine is a very stable substance 
and is widely used in various forms as a dyestuff. 

With this as our starting point we sought to make systems that might 
resemble the laminated system that appears to exist in the chloroplast. The 
idea that organic substances such as phthalocyanine might be electronic 
conductors under certain conditions was actually born, as far as I was 
concerned, in a discussion with Professor Michael Polanyi (University of 
Manchester) at the time we received the phthalocyanine from Linstead in 
1936. We did not do anything about it then except insofar as we used the 
phthalocyanine as a catalyst for hydrogen activation, much like platinum. 


Fic. 4. Structural formula of phthalocyanine. 


That was about the extent of my early activity with phthalocyanine as a 
possible electronic conductor (Calvin, Cockbain & Polanyi, 1936; Calvin, 
Eley & Polanyi, 1936). One of my associates in the laboratory at Man- 
chester, D. D. Eley, also working with phthalocyanine, went to work 
along the electronic lines, and some twelve years later he published the 
first paper, I think, on this subject, in which he demonstrated that phthalo- 
cyanine behaved as an organic semiconductor (Eley, 1948). 

This was enough to trigger us again, and now the basic idea was born 
that the energy conversion process in the chloroplast might be a process in 
which the excited chlorophyll molecule had some of the properties of an 
organic semiconductor. The transformation from an excited chlorophyll 
molecule into chemical potential was envisaged as separation of charge 
rather than a separation of atoms. We now had to devise that physical 
configuration of the molecules which might permit the demonstration that 
this phenomena could occur. 

The structure of the actual photosynthetic apparatus suggests a laminated 
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structure in which chlorophyll molecules are arranged in some order, 
perhaps with carotenoids and other lipid-type materials on one side. On 
one side of the chlorophyll layer there could be electron-accepting species 
and on the other side of the layer there could be electron-donating species. 
In this way one could visualize a laminated system resembling the donor- 
acceptor systems in the atomic and ionic lattices that the physicists had 
been describing, which did succeed in converting electromagnetic energy 
into charge separation in a fairly well understood manner. 

We proceeded to explore this idea and develop it to see what the limita- 
tions of it were and what the requirements were for producing charge 
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Fic. 5. Diagram of sample conductivity cells. 


separation in an organic system using light. First, we had to show that the 
material was indeed a semiconductor. We performed the same experiments 
that Eley had done and came out with much the same general results. The 
next step was taken when we started to construct laminated (layered) 
structures in which we added either electron donors or electron acceptors 
to the phthalocyanine (chlorophyll analogue) layer (Kearns & Calvin, 
1958; Kearns, 1960; Kearns, Tollin & Calvin, 1960). Our first measure- 
ments were purely of conductivity: Could these layers carry an electronic 
current in the dark? What would happen to the conductivity of such a 
system if one put donor or acceptor layers together in such a configuration? 

Fig. 5 shows the diagram of the apparatus that was used to perform these 
experiments. The electrode system shown here was actually an inter- 
leaving of two aquadag combs, and lying on top of it, by sublimation or 
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evaporation, was the layer of the sample. We have performed the experi- 
ment with phthalocyanine and with about half a dozen other aromatic 
pi-electron-containing systems. The lamination was achieved by putting 
on the back surface of the sublimed layer the donor or acceptor system, 
whichever it might be. Most of the work on the phthalocyanine and on the 
other aromatic systems (violanthrene, perylene, etc.) was done with electron 
acceptors as the top layer (Kearns & Calvin, in press). 

The results of such an experiment are shown in Fig. 6 in which we plot 
the log of the current flowing between the two electrodes (maintained at a 
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Fic. 6. Variation of dark conductivity and photoconductivity of phthalocyanine with 
amount of o-chloxanil added. 


50 to go V differential) as a function of the amount of electron acceptor 
which was put on top of the phthalocyanine layer. This, then, is the current 
flowing between the electrodes, i.e., through the phthalocyanine, as it is 
affected by the electron acceptor which is placed on top. The conductivity 
of this system rises very steeply as very small amounts of electron acceptor 
(o-chloranil) are added to the surface layer. This is true of the dark current 
and also of the photocurrent, which is the difference between the light 
current and the dark current. The o-chloranil (o-tetrachloroquinone) is a 
very good electron acceptor: as a small amount of the electron acceptor is 
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placed above the phthalocyanine layer, the conductivity goes up by several 
powers of ten. 

Apparently the acceptor pulls electrons out of the donor, putting elec- 
trons into orbitals of the o-chloranil and leaving behind electronic vacancies 
in the phthalocyanine molecules. By putting a potential between the two 
electrodes, it becomes possible to move charge much more readily between 
them because there are now low-lying, unoccupied orbitals between which 
the electrons from the full orbitals can move. The electronic state in the 
organic solid after any particular move is the same as it was before, save for 
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Fic. 7. Schematic representation of donor and acceptor molecules and ions imbedded 
in a donor layer or an acceptor layer, respectively. From this diagram it is clear that 
process (1), the transfer of an electron from an acceptor negative ion to a neutral neighbor, 
produces a state of the system which is energetically identical with the initial state. 
Similarly, there is no net change in energy as a result of process (2) which rearranges 
charge in the donor layer. In the case of a neutral free radical, however, the electron 
transfer process (3) does not result in a state energetically equivalent to the initial state. 
Since processes (1) and (2) simply change the location of negative and positive charges 
respectively, with no net change in energy, we can consider the orbitals involved in the 
electronic rearrangements as forming conduction bands. If, however, the lattice were made 
up of A~ radical ions (no A’s) irrespective of the cations, or entirely of D* radical ions (no D’s) 
irrespective of the anions, there would be no identical vacant orbitals into which the charge 
carriers could move and hence no conduction bands (however narrow). This last situation 
would correspond to the completely filled free radical system as in process (3) above. 


the passage of electrons from one electrode to the other. Without these 
vacancies for “‘hole’’ motion in the donor layer (electron motion in the 
acceptor layer), the conductivity would be very low (Keppler, Bierstedt & 
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Merrifield, 1960). A diagram representing this situation is shown in Fig. 7 
(Kearns & Calvin, in press). 

The light effect involved in the excitation of phthalocyanine to an 
excited state leads to a higher population of electrons in the acceptor 
molecules, making a higher population of electronic vacancies in the donor 
matrix so that the conductivity increases over that in the dark. 

This is essentially the basic notion which we believe describes the model 
system as we now have it. We have used a wide variety of donor systems 
and a considerable variety of acceptor systems, and the behavior has 
fulfilled all of the expectations of such a description (Kearns, Tollin & 
Calvin, 1960; Kearns & Calvin, in press). 
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Fic. 8. Electron spin resonance spectrum of o-chloranil “doped” phthalocyanine. The 
curve represents the first derivative of absorption. 


There are various other properties of such a system which should follow, 
and we have measured them. For example, we have measured the kinetics 
of the photoconductivity—how it grows and decays—at various tempera- 
tures. One observation is particularly interesting, and it has to do with 
the fact that in a system of this kind, the electrons in the acceptor layer are, 
in effect, unpaired electrons. They may be considered as in very narrow 
conduction bands, i.e. they are in singly occupied orbitals in the molecules. 
The same things may be said of the unpaired electron that remains be- 
hind. One should see those unpaired electrons by virtue of their magnetic 
spin resonance and indeed we have seen them in that way. Fig. 8 shows 
the electron spin resonance spectrum of o-chloranil “doped” phthalo- 
cyanine; the g value is very close to that of a free electron.t Fig. 9 shows the 
change of that signal following illumination and darkening. When the light 


. . . —— 
_ t g value is the gyromatic ratio defined by B=ygH where p is induced [dipole’ and TH 
is the magnetic field. 
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is turned on, the spin signal is decreased and when the light is turned off, 
the spin signal comes back. The reason for that in this particular situation 
is that almost all of the o-chloranil molecules adjacent to the phthalocyanine 


are already mono-negative ions in the dark, and, when the light is turned on, 
Light on 


Light off 


SPIN CONCENTRATION ——> 


le Min >| 
. Effect of illumination on the electron spin resonance signal of o-chloranil 


IG. 
“doped” metal-free phthalocyanine. Curve represents unpaired spin concentration 
v. time. 


a second electron is transferred so they become di-negative ions. Thus, 
there is a decrease in the total number of unpaired spins in the light. 
However, we do have systems that go the other way, because the equili- 
brium distribution is different. This depends on the relative orbital 
energy levels of the two systems, and we can get effects of this type ranging 
between photodecrease and photoincrease of unpaired spins. 

Figure 10 shows how separation of charge can be accomplished in this 
model system if it is properly constructed. Here is a matrix of phthalo- 
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Fic. 10. Schematic diagram of polarization apparatus. 
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cyanine, the surface of which is an o-chloranil layer. ‘There will be some 
negative charge trapped in the o-chloranil (acceptor) layer, and the positive 
charge will remain in the phthalocyanine (donor) layer. This will induce a 
polarization in the pair of electrodes between which the double layer is 
placed, and the polarization will be increased by shining light absorbed by 
phthalocyanine on the double layer, resulting in an additional accumula- 
tion of negative charge in the quinone and positive charge in the phthalo- 
cyanine. This is photochemically-induced separation of oxidizing power 
(positive holes) and reducing power (o-chloranil double negative ions), and 
presumably this kind of thing can occur in the individual layers that are 
seen in the chloroplasts. 
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Fic. 11. Semilog plot of time dependence of photoconductivity, light-induced electron 
spin resonance and light-induced polarization in doped phthalocyanine. 
We have studied the kinetics of various effects, the conductivity, the 
polarization, the electron spin resonance, and they’are all apparently the 


result of the same process. Fig. 11 shows the kinetics of these three 
phenomena. 
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The entire system and all of the processes can be described by the series 
of reactions shown in Fig. 12. In the dark, the o-chloranil and phthalo- 
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Fic. 12. Reactions of a solid matrix of Pc with a film of 0-Q. 


cyanine react to form a pair of radical ions (Fig. 12-1); in the light at 
7,000 A there is another transfer to form a double negative ion (Fig. 12-2) 
In the dark it goes back (Fig. 12-3). At 4,000 A, where the semi-quinone 
anion absorbs, we can excite this molecule and transfer an electron back 
into the phthalocyanine layer, which then leads to recombination and we 
get a decrease in conductivity (Fig. 12-4). 

This concludes what I want to say about the model experiments. I 
think it is clear from the model systems that organic substances of at least 
one type (large, aromatic molecules) can be semiconductors and photo- 
conductors; and, what is more, by suitably adjusting the combination of 
donor and acceptor systems, one can make from them a laminated structure 
in which it is possible to demonstrate the separation of charge induced by 
the absorption of light, the very thing that we were postulating might 
occur in the chloroplasts. 


4. The Relation to the Photosynthetic Apparatus 
The remainder of the discussion is an attempt to see how many of the 
kinds of measurements that were performed on the model systems we can 
perform on the biological material, and how truly these measurements 
tell us what goes on in the biological material in the same manner as they 


tell us what goes on in the model systems. 
The one thing that is difficult to do in the biological material is the very 
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first measurement which we made on the model system, namely, the 
conductivity. In the model systems we could make the configuration to 
fit the electrodes big enough so that we could handle it. In the biological 
materials, these lamina or lamella are pretty small—of the order of 30 to 
60 A thick. So far, no one has succeeded in making electrode systems that 
can be placed on the individual lamella to measure the conductivity or the 
photoconductivity of such small single units and larger ones do not seem 
to be available. 

However, there have been conductivity measurements on dried chloro- 
plasts which show that the dried chloroplast preparations are indeed photo- 
conductive, but they are subject to questionable interpretation in such a 
complex system. Nevertheless, we shall take the measurements at their 
face value later on (Arnold & Clayton, 1960; Arnold & Maclay, 1958). 


(A) ELECTRON SPIN RESONANCE IN CHLOROPLAST MATERIALS 


One of the principal types of experiment that we have done entailed 
looking for the unpaired electrons that might be generated by the light in 
the biological system. We put the biological system inside a resonance 
cavity to see if any unpaired electrons were generated when the light was 
turned on to it. Very early we found out that there were unpaired electrons 
of this type. The first experiments were done with eucalyptus leaves in 
1956, but we found that the results were not reproducible because of the 
variability of the eucalyptus leaves themselves. Towards the end of that 
year, the same kind of observations were made at St. Louis (Commoner, 
Heise & Townsend, 1956; Commoner et al., 1957). We ourselves made 
some chloroplast preparations and investigated the same thing (Calvin & 
Sogo, 1957; Sogo, Pon & Calvin, 1957). This type of experiment can be 
done with whole organisms (whole bacteria or chromatophores which are 
the chloroplasts of bacteria) or with pieces of chloroplasts from the green 
plant. 

Figure 13 shows the light-produced signals from whole spinach chloro- 
plasts. Light of 40 kcal per quantum was directed on to these materials and 
there are not many chemical bonds that can be broken by as little as 40 
kcal. ‘The signal indicates the appearance of unpaired electrons. Any free 
radical will give this kind of signal. Most biological material that is under- 
going rapid metabolism will show signals of this kind; it is not necessary 
to have light shining on them. The question, therefore, is: What kind of 
unpaired electrons are these? Are these ordinary free radicals, or are these 
electrons produced in photoprocesses such as have been described in the 
earlier models? If there were chemical free radicals produced by some 
secondary reactions, one might expect that if the system were cooled 
enough, the chemical reaction might stop and only the physical process of 
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electron transport would remain. We attempted to do this by cooling the 
sample to —150° C but we still got light-induced signals. 

Figure 14 shows the kinetic behavior of such a signal for Rhodospirillum 
rubrum which use bacteriochlorophyll. This experiment was done in a 
series of different temperatures, and it was found that the signals changed in 
character with the variation in temperature. There is also a variation in the 
signal with time. At 25° C after the light is turned on, the signal rises just 
as fast as the apparatus will follow it and reaches its equilibrium value 
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Fic. 13. Light signals from whole spinach chloroplasts. 


immediately, and when the light is turned off, the signal drops as rapidly 
as the equipment will follow it. In other words, the rise time and decay 
time that we have so far been able to see are not intrinsic to the electrons 
but rather they are imposed by the limits of the apparatus. As the material 
is cooled from 25° C to —15° C, a considerably larger signal appears, but 
there is a slow rising component in the signal. If the temperature is lowered 
still further to —55° C, some of the “extra” signal which is purely chemical 
(secondary, in other words) is frozen out, but not all of it: there is still a 
very fast rise followed by a slow rise, and the decay time shows the same 
characteristic—a fast decay and a slow decay. There are quite clearly 
several different kinds of unpaired electrons produced in this organism 


278 MELVIN CALVIN 


when the light is shone on it at —55° C. When the temperature reaches 
—160° C, we have none of the slow signals left at all, only the fast signals. 
Both the rise and decay are fast. 

This phenomenon is most readily interpreted by the obvious notion that 
we are first making a conducting type of unpaired electron which then is 
undergoing chemical change inside the biological material, also via one- 


_ Fic. 14. Electron spin resonance signals from Rhodospirillum rubrum; 5 min continuous 
illumination. 
electron reactions. We are seeing at room temperature and intermediate 
temperatures not only the physically-produced charge separation but 
chemical radicals as well, and as we cool the solution we freeze out the 
chemical reaction and have left only the physical process itself (Calvin, 
1959b). 

We really need something more to characterize the unpaired electrons. 
The rate of growth and decay, temperature dependence, etc., is not 
enough to identify these electrons as physically-produced instead of 
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chemically-produced. So far, the g values, i-e., the magnetic characteristics 
of the electron, appear to be those of free electrons, that is, electrons that 
are free to move around within the molecule and within the lattice. 

We have tried to use one or two other ways of characterizing the electron, 
such as looking for hyperfine structure, that is, looking for the interaction 
of the unpaired electron with specific nuclei, but so far this has not been 
successful. Either there are so many nuclear hyperfine interactions as to 
overlap, or the quasi-solid matrix broadens the lines so that no very 
useful resolution has yet been possible (Commoner et al., 1957). 


(B) APPARENT SPECTRAL EFFICIENCY 


The next characterization after the kinetics and the g value was the 
efficiency with which light produces the spin signals—the quantum 
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Fic. 15. Absorption and action spectra of chloroplasts (Sogo, Carter & Calvin, in 
press) I, = 101° quanta/sec. Band width = 100 A. 


efficiency for the production of these electrons. This is in the same vicinity 
as the quantum efficiency of photosynthesis, 1.e., of the order of one to 
one-tenth. Greater precision is difficult to obtain at present. 

The quantum efficiency with respect to wavelength is the next question. 
How does the quantum efficiency vary with wavelength? This type of 
measurement is somewhat easier to perform. The values that are here given 
are not absolute. The relative value for the production of electrons at one 
wavelength compared to the value for the production at another wave- 
length is compared with the absorption of chlorophyll. Figure 15 shows the 
action spectrum for the production of free electrons and the absorption 
spectra for the chloroplasts. It looks as though a minimum action occurs at 
a place where the absorption is greatest. This turns out to be what one 
would expect, judging from the configuration of the system. We used a 
thick layer of chloroplasts so that all the light was absorbed, and in those 
regions in which the light is most strongly absorbed, the concentration of 
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separated charges is the greatest and the recombination occurs at its 
fastest rate (Sogo, Carter & Calvin, in press). Since we are seeing pro- 
duction minus recombination, we see a minimum at the highest concen- 
tration of production. There is probably another effect also contributing 
to this shape for the ‘‘action” spectrum. It is possible to show by com- 
binations of different wavelengths that one can get more than additive 
effects and less than additive effects for the sum of two or more different 
wavelength illuminations. 

This idea of additive effects of light of varying wavelengths is a con- 
stituent part of the development of our knowledge of the behavior of plants 
with respect to light: it is known as the Emerson effect. In simplest terms 
it may be defined by the following observations; measure the number of 
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Fic. 16. Absorption spectrum (Shibata, Benson & Calvin, 1954) and action spectrum 
(Sogo, Carter & Calvin, in press) of Rhodospirillum rubrum. I, = 5 X 1014 quanta/sec. 
Band width = 66 A. 
molecules of oxygen produced per quantum of red light; measure the 
number of molecules of oxygen produced per quantum of green light; and 
then put both the red and the green light together on the same plant. This 
can be done under circumstances such that when the two wavelengths of 
light are together on the plant, one gets more (or less) than the sum of 
the two separately. In other words, there is a collaboration of the two wave- 
lengths of light (Emerson, Chalmers & Cederstand, 1957). The experiment 
can be done under conditions where there is a negative collaboration of the 
two wavelengths (they cancel each other) depending on the light inten- 
sities and other conditions of the experiment (Govindjee, Rabinowitch & 
Thomas, 1960; Ichimura & Rabinowitch, 1960). 

The same type of experiment can be performed with the photo-induced 
spin signals, at low temperatures (Androes, private communication). This 
is One more reason to suppose that the spin signals that we see are indeed 
something very close to the quantum conversion process itself. 

Figure 16 shows the absorption and action spectrum for the purple 
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acteria, and you can see exactly the same relationship between the absorp- 
ton and the action (Shibata, Benson & Calvin, 1954). 
If we have achieved separation of charge in the molecular lattices and if 
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Fic. 17. Delayed light emission from different biological materials. 


he charge is allowed to recombine, light can be emitted at low tempera- 
ures. Figure 17 shows the delayed light emission from spinach chloroplasts, 
Yhlorella pyrenoidosa, and Nostoc. The wavelength distribution is what 
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one might expect, and also the kinetics of the decay of this light emission 
are exactly the kinetics of the decay of the spin signal (Tollin & Calvin! | 
1957; Tollin, Fujimori & Calvin, 1958a). 7 || 

Two pieces of work that have been done by W. Arnold at the Oak Ridge; 
National Laboratory are important here (Arnold & Maclay, 1958; Arnold 
& Clayton, 1960). In this case, Arnold was measuring the change in the | 
light absorption of chromatophores from Rhodopseudomonas spheroides | 
(purple bacteria) induced by illumination with a second light, usually off 
longer wavelength. Figure 18 shows the change in absorption at 4,200 Ay 
the change occurs at 300° K just as fast as the instrument can measure it.J 
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Fic. 18. Absorption changes at 4,200 A induced by illumination with A> 6,400 A 
(Arnold & Clayton, 1960). 


It decays relatively slowly because part of the decay is chemical and part ' 
is physical. I want to call attention to the part taken at 1° K (Fig. 18). At. 
that low temperature, there is very little chemistry going on, and the 
spectral change occurs just as fast as the instrument permits measure- 
ment, in fact, faster than the instrument will follow. Here is clear evidence 
that the light is introducing a physical change, a change that can only be 
motion of electrons and not of atoms. Figure 19 shows Arnold’s measurement 
of the photoconductivity of dried chromatophore film; again when the 
light is turned on, the conductivity increases very abruptly and then there 
is a very rapid drop. 

I hope that soon we shall be able to make conductivity measurements in 


the radiofrequency range which do not require direct electrode con- 
nections. 
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Fic. 19. Decay of transmission (X) at 6,100 A, T = 77°K, following illumination with 
A > 6,400 A of chromatophores from R. spheroides (Arnold & Clayton, 1960). 


(Cc) QUANTUM CONVERSION IN BIOLOGICAL MATERIAL 


I want to draw a picture of what I think is the primary quantum con- 
version process that goes on in that layer of chlorophyll, and other pig- 
ment, in the lattice. We know a little about the chemical composition of 
the chloroplast itself: it is a lipoprotein together with pigments. There are 
a number of specific molecules which are present in the chloroplast, two 
of them being chlorophyll and carotenoid. There are two other rather 
important molecules which are present in large amounts in the chloroplast 
and which have an important bearing on energy conversion. These 
systems require not only the presence of the absorber but the presence of 
an acceptor molecule for electron transfer to occur, and to finish this 
process we must have something present as a donor molecule. The other 
two species that are well established in the chloroplasts are (1) a very 
important quinone called plastoquinone (Bishop, 1959; Crane, 1959; 
Lester & Crane, 1959; Crane, Ehrlich & Kegel, 1960), and (2) a variety of 
molecules which might be donors. There is one particular type of the 
latter which I would like to select as a very likely donor molecule, namely, 
the iron heme (cytochrome) species which are always present in the 
chloroplasts and chromatophores (Kamen, 1956). 

Figure 20 shows the two boundaries of the pigment layer. Chl are the 
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chlorophyll molecules in some array, possibly including carotenoids. ‘The 
first act of photosynthesis is, of course, the absorption of the quantum by. 
the chlorophyll molecule to produce an excited chlorophyll molecule. If | 
this were a perfect atomic or ionic lattice, there would be absorption by | 
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Fic. 20, Schematic arrangement of chlorophyll and possible donor and acceptor 
molecules in the chloroplast. 

The system in the chloroplast might structurally bear some resemblance to the model 
shown here, the chlorophyll having associated with it on the one side the electron acceptor, 
plastoquinone, in a lipid environment, and on the other side electron donor materials, 
such as the cytochromes, in an aqueous environment. Following the absorption of aj 
quantum in chlorophyll (eq. 1) it will migrate by resonance transfer to a suitable site near 
the quinone where electron transfer to the quinone will take place (eq. 2). The resulting 
vacancy can migrate by hole diffusion, that is, electron transfer from normal chlorophyll, 
into the vacant orbital of the neighboring chlorophyll positive ion. This process is the one 
that most nearly resembles the properties of a semiconductor and it permits the oxidant 
(chlorephyll positive ion) to separate from the reductant (electrons in the quinone 
orbitals) by a very nearly temperature-independent process. The oxidant then captures 
an electron from a suitable reducing agent, such as ferrocytochrome, thus producing a. 
ferricytochrome and regenerating normal chlorophyll (eq. 3). 
| 
| 


the entire lattice, but this is not so. It is a molecular lattice, in which inter- 
actions between molecules are relatively small compared with the inter- 
actions between atoms in germanium or ions in cadmium sulfide. The 
result is that the migration of this exciton occurs by resonance transfer. 
between neighbouring chlorophyll molecules until it arrives at one that. 


| 
| 
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is bound, or adjacent, to an electron acceptor such as quinone. The quinone 
of which I am speaking, i.e. plastoquinone; is one which was found in the 
chloroplasts as early as 1955 by Kofler (Kofler et al., 1959) and it has since 
been shown to be uniquely characteristic of the chloroplasts and not of 
other parts of the plant or cell. The plastoquinone is closely related to a 
similar quinone known as ubiquinone which is found in the (nonphoto- 
synthetic) mitochondria of plants and animals (Morton, 1958; Laidman, 
Morton, Paterson & Pennock, 1960). 

Let us use the quinone as a likely electron acceptor; there is one plasto- 
quinone molecule present for about 400 chlorophyll molecules. When the 
exciton reaches the chlorophyll molecule which is bound by a charge trans- 
fer complex to the quinone, ionization occurs, the electron is transferred, 
leaving behind in this chlorophyll molecule an electronic vacancy, or 
“hole”. At this point, we must introduce the idea of charge migration (see 
legend to Fig. 7). Up until now, energy migration has been by resonance 
transfer of an exciton. After ionization occurs, I suggest (require, in fact) 
that there is a migration by an electron from a neighbouring chlorophyll 
molecule to the “hole’’, so the “hole” moves down to the next chlorophyll 
molecule until it comes adjacent to a ferro-heme (cytochrome). When the 
“hole” reaches this point, electron transfer occurs from the iron (Chance & 
Nishimura, 1960; Arnold & Clayton, 1960) or another donor to neutralize 
it, and the pigment layer is returned to its original condition.t 

A separation of charge has been achieved, and the oxidized donor 
becomes an oxidant and the electron in the quinone 1s the reductant. The 
reductant can go on to reduce carbon dioxide (reaction (2), p. 4) and the 
oxidant can go on to generate oxygen (reaction (z), p. 4). ATP is required 
to help on the reduction of CO, and for many other energy-requiring 
operations. One possibility is that ATP may be generated during the passage 
of oxidant to oxygen (reaction (z), p. 4). ATP may also be generated on the 
reduction side (reaction (2), p. 4) and by recombination (reaction (3), 
p- 4) as well (see also the legend to Fig. 20). 


5- Conclusion 


What is the primary quantum conversion act? The primary quantum 
conversion act is an ionization occurring in a charge transfer complex. 
This is what it amounts to in chemical terms. But this cannot occur in 


+ The alternative step of electron transfer from donor to excited chlorophyll has also 
been suggested as a primary step (Kamen, 1961). A discussion of these two alternatives 
has led us (Calvin, 1961) to the present suggestion as the more likely of the two. The 
existence of two different primary quantum acts (Emerson, Chalmers & Cederstrand, 
1957; Govindjee, Rabinowitch and Thomas, 1960; Haxo, 1960; French, 1961; Allen, 
Piette and Murchio, 1961) makes it not unlikely that both of these two alternatives may 
occur, although either one alone would be sufficient to support the whole process. 
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isolated charge transfer molecules in solution because the products cannot 
escape from each other. The primary quantum conversion act as it occurs 
in modern photosynthesis can only take place in a laminated structure 
where the electrons and holes can escape from each other by electron 
migration and not by atomic migrations. This is the essential feature 
introduced here which differs from all the previous notions of how quantum — 
conversion occurs in chemistry or biology. 
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